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Abstract 

The  majority  of  labor  transactions  throughout  much  of  history  and  a  significant  fraction  of  such  transac- 
tions in  many  developing  countries  today  are  "coercive,"  in  the  sense  that  force  or  the  threat  of  force  plays  a 
central  role  in  convincing  workers  to  accept  employment  or  its  terms.  We  propose  a  tractable  principal-agent 
mode!  of  coercion,  based  on  the  idea  that  coercive  activities  by  employers,  or  ''guns,"  affect  the  participation 
constraint  of  workers.  We  show  that  coercion  and  effort  are  complements,  so  that  coercion  increases  effort. 
Nevertheless,  coercion  is  always  "inefficient,"  in  the  sense  of  reducing  utilitarian  social  welfare.  Better  outside 
options  for  workers  reduce  coercion,  because  of  the  complementarity  between  coercion  and  effort:  workers  with 
better  outside  option  exert  lower  effort  in  equilibrium  and  thus  are  coerced  less.  Greater  demand  for  labor 
increases  coercion  because  it  increases  equilibrium  effort.  We  investigate  the  interaction  between  outside  op- 
tions, market  prices,  and  other  economic  variables  by  embedding  the  (coercive)  principal-agent  relationship  in 
a  general  equilibrium  setup,  and  study  when  and  how  labor  scarcity  encourages  coercion.  We  show  that  general 
(market)  equilibrium  interactions  working  through  prices  lead  to  a  positive  relationship  between  labor  scarcity 
and  coercion  along  the  lines  of  ideas  suggested  by  Domar,  while  those  working  through  outside  options  lead  to  a 
negative  relationship  similar  to  ideas  advanced  in  neo-Malthusian  historical  analyses  of  the  decline  of  feudalism. 
A  third  effect,  which  is  present  when  investment  in  guns  must  be  made  before  the  realization  of  contracting 
opportunities,  also  leads  to  a  negative  relationship  between  labor  scarcity  and  coercion.  Our  model  also  predicts 
that  the  slave  trade  makes  slaves  worse  off,  conditional  on  enslavement,  and  that  coercion  is  more  viable  in 
industries  that  do  not  require  relationship-specific  investment  by  workers. 

Keywords:  coercion,  feudalism,  labor  scarcity,  principal-agent,  slavery,  supermodularity. 
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"In  the  context  of  universal  history,  free  labor,  wage  labor,  is  the  peculiar  institution" — M. 
I.  Finiey^ 

1     Introduction 

The  standard  economic  models  of  the  labor  market,  regardless  of  whether  they  incorporate  imperfec- 
tions, assume  that  transactions  in  the  labor  market  are  "free" .  For  most  of  human  history,  however, 
the  bulk  of  labor  transactions  have  been  "coercive,"  meaning  that  the  threat  of  force  was  essential  in 
convincing  the  worker  to  take  part  in  the  employment  relationship,  and  thus  in  the  determination  of 
his  or  her  compensation.  Slavery  and  forced  labor  were  the  most  common  form  of  labor  transactions  in 
most  ancient  civilizations,  including  Ancient  Greece,  Ancient  Egypt,  Rome,  several  Islamic  and  Asian 
Empires,  and  most  knowTi  pre-Colombian  civilizations  (e.g.,  Meltzer,  1993,  Patterson,  1982,  Lovejoy, 
2000,  Davis,  2006).  Slavery  was  also  the  basis  of  the  plantation  economies  in  the  Caribbean  (e.g., 
Curtin,  1990,  Klein  and  Vinson,  2007),  in  parts  of  Brazil  and  Colombia  and  in  Southern  United  States 
(e.g.,  Patterson,  1982,  Fogel  and  Engerman,  1974,  Wright  1978),  while  forced  labor  played  a  major 
role  in  Spanish  Latin  America  both  in  mining  and  in  the  encomiendas  early  on,  and  then  in  the  subse- 
quent hacienda  system  that  developed  in  much  of  Latin  America  (e.g.,  Lockhart  and  Schwartz,  1983, 
Lockhart,  2000).  \\Tiile  formal  slavery  has  been  rare  in  Europe  since  the  middle  ages,  until  the  19th 
century  feudal  labor  relations,  which  include  both  forced  labor  services  from  serfs  and  various  special 
dues  and  taxes  to  landowners,  were  the  most  important  type  of  employment  relationship  except  in  the 
cities  (e.g.,  Blum,  1998).  Even  today,  the  United  Nations'  International  Labor  Organization  (ILO) 
estimates  that  there  are  over  12.3  million  forced  laborers  worldwide  (Andrees  and  Belser,  2009).^ 

The  prevalence  of  slavery  and  forced  labor  in  human  history  raises  the  question  of  when  we  should 
expect  labor  to  be  transacted  in  free  markets  rather  than  being  largely  or  partly  coerced.  In  a 
seminal  paper,  Domar  (1970)  provided  one  answer:  slavery  or  serfdom  should  be  more  likely  when 
labor  is  scarce  so  that  (shadow)  wages  are  high.  This  answer  is  both  intuitive  and  potentially  in 
line  with  the  experience  in  the  Caribbean,  where  Europeans  introduced  slavery  into  islands  that  had 
their  population  decimated  during  the  early  phases  of  colonization.  In  contrast,  Acemoglu,  Johnson 
and  Robinson  (2002)  suggest  and  provided  evidence  for  the  view  that  Europeans  were  more  likely 
to  set  up  labor-coercive  and  extractive  institutions  when  population  density  was  high  and  labor  was 
relatively  abundant.  Also  consistent  with  this  view,  though  working  through  a  different  mechanism 
and  in  a  different  context,  is  the  relationship  between  population  and  feudalism  emphasized  by  the 
famous  "neo-Malthusian"  theory  of  feudal  decline,  exemplified  by  Habakkuk  (1958),  Postan  (1973, 
2002),  Leroy  Ladurie  (1974),  and,  from  a  somewhat  different  perspective.  North  and  Thomas  (1971). 
This  theory  claims  that  the  coercive  feudal  labor  relations  started  their  decline  when  labor  became 


'From  Finley  (1976),  quoted  in  Engerman  (1999),  p.  275. 

"The  ILO  estimates  that  of  these  12.3  million,  20%  are  coerced  by  the  state  (largely  into  military  service),  14%  are 
forced  sex  workers,  and  the  remaining  66%  are  coerced  by  private  agents  in  other  industries,  such  as  agriculture,  mining, 
ranching,  and  domestic  service  (Andrees  and  Belser,  2009).    Our  model  applies  most  directly  to  the  last  category. 


scarce  following  the  black  death  and  other  demographic  shocks  that  reduced  population  and  raised 
per  capita  agricultural  income  throughout  Europe  in  the  16th  century.  The  relationship  between  labor 
scarcity/abundance  and  coercion  is  also  important  in  understanding  the  causes  of  continued  prevalence 
of  forced  labor  in  many  developing  countries.  There  is  also  disagreement  about  the  "efRciency"  of 
slavery  and  other  forced  labor  arrangements.  Fogel  and  Engerman  (1974)  famously  claimed  that 
slavery  in  the  US  South  was  not  unprofitable  and  that  in  fact  it  may  have  even  been  "efficient" .  This 
view  contrasts  strongly  with  those  of  Genovese  (1976)  and  Wilhams  (1944). 

In  this  paper,  we  develop  a  simple  model  of  coercive  labor.  In  partial  equilibrium,  our  model 
is  a  version  of  the  principal- agent  framework,  with  two  crucial  properties.  First,  the  agent  (worker) 
has  no  wealth  so  that  there  is  a  limited  Uability  constraint,  and  the  principal  can  pimish  as  well  as 
reward  the  agent.  Second,  the  principal  chooses  the  amount  of  "guns"  (coercion),  which  influences 
the  reservation  utility  (outside  option)  of  the  agent. ^  The  first  of  these  changes  has  been  explored  in 
several  papers  (e.g.,  Chwe,  1990,  Dow,  1993,  Sherstyuk,  2000).  The  second  is,  to  our  knowledge,  new, 
and  is  crucial  for  our  perspective  on  coercion  and  for  all  of  our  results;  it  captures  the  central  notion 
that  coercion  is  mainly  about  forcing  workers  to  accept  employment,  or  terms  of  employment,  that 
they  would  otherwise  reject.'* 

Our  basic  principle-agent  model  leads  to  several  new  insights  about  coercive  labor  relations.  First, 
we  show  that  coercion  always  increases  the  effort  of  the  agent,  which  is  consistent  with  Fogel  and 
Engerman's  view  that  Southern  slavery  was  productive.  In  fact,  we  show  that  coercion  may  increase 
the  level  of  effort  above  the  first  best  (i.e.,  that  which  would  have  been  observed  in  the  absence  of 
any  information  asymmetries).  Second,  we  show  that  coercion  is  always  "socially  inefficient,"  because 
it  involves  an  (endogenously)  costly  way  of  transferring  resources  (utility)  from  workers  to  employers. 
This  result  contrasts  strongly  with  Fogel  and  Engerman's  view  that  Southern  slavery  was  "efficient".^ 
Third,  perhaps  somewhat  surprisingly,  we  find  that  workers  with  lower  (ex  ante)  outside  option  are 
coerced  more  and  provide  higher  levels  of  effort  in  equilibrium.  The  intuition  for  this  result  illustrates 


''Throughout  the  paper,  "guns"  stand  in  for  a  variety  of  coercive  tools  that  employers  can  use.  These  include  the 
acquisition  and  use  of  actual  guns  by  the  employers  as  a  threat  against  the  workers  or  their  families;  the  use  of  guards  and 
enforcers  to  prevent  workers  from  escaping  or  to  force  them  to  agree  to  employment  terms  favorable  to  employers;  the 
confiscation  of  workers'  identification  documents;  the  setting  up  of  a  system  of  justice  favorable  to  employers;  investment 
in  political  ties  to  help  them  in  conflictual  labor  relations;  and  the  use  of  paramilitaries,  strike-breakers  and  other  non- 
state  armed  groups  to  increase  their  bargaining  power  in  labor  conflicts.  In  all  of  the  instances  of  coercion  mentioned 
here,  for  example,  in  the  Caribbean  plantation  complex,  African  slave  trade,  the  mita,  the  encomienda,  the  feudal  system, 
and  contemporary  coercion  in  Latin  America  and  South  Asia,  employers  used  several  of  these  methods  simultaneously. 
This  view  of  coercion  is  consistent  with  the  historical  and  contemporary  examples  given  above  as  well  as  with  the 
1930  ILO  Convention's  definition  of  compulsory  labor  as  "work  or  service  which  is  exacted  from  any  person  under  the 
menace  of  any  penalty  and  for  which  the  said  person  has  not  offered  himself  voluntarity,"  (quoted  in  Andrees  and  Belser, 
2009,  p.  179).  Discussing  contemporary  coercion,  Andrees  and  Belser  (2009)  write  that  "a  situation  can  qualify  as  forced 
labor  when  people  are  subjected  to  psychological  or  physical  coercion.  .  .  in  order  to  perform  some  work  or  service  that 
they  would  otherwise  not  have  freely  chosen,"  (p.  179). 

■"It  also  offers  a  straightforward  explanation  for  why  gang  labor  disappeared  after  the  Reconstruction,  which  was  a 
puzzle  for  Fogel  and  Engerman.  In  this  light,  our  model  is  much  more  consistent  with  Ransom  and  Sutch's  evidence  and 
interpretation  of  slavery  and  its  aftermath  in  the  South,  in  particular  with  their  view  that  agricultural  output  collapsed 
in  the  post-bellum  South  due  to  the  withdrawal  of  black  labor  (Ransom  and  Sutch,  1975,  1977,  1979).  See  also  Irwin 
(1994)  for  an  empirical  investigation. 


a  central  economic  mechanism:  in  our  model — and,  we  believe,  most  often  in  practice — effort  and 
coercion  are  "complements".  When  the  employer  wishes  to  induce  effort,  he  finds  it  optimal  to  pay 
wages  after  observing  high  output,  so  he  must  pay  wages  frequently  when  he  induces  high  effort. 
Greater  ex  ante  coercion  enables  him  to  avoid  making  these  payments,  which  is  more  valuable  when 
he  must  pay  frequently,  hence  the  complementarity  between  effort  and  coercion.  This  observation 
also  implies  that  more  "productive"  employers  will  use  more  coercion,  and  therefore  that  a  worker 
is  worse  off  when  matched  with  a  more  productive  firm.  This  contrasts  with  the  standard  results  in 
models  of  noncoercive  labor  markets,  where  ex  post  rent  sharing  typically  makes  workers  matched 
with  more  productive  employers  better  off.  It  also  implies  that  coerced  workers  may  receive  high 
expected  monetary  compensation,  despite  having  low  welfare,  which  is  consistent  with  the  findings 
of  both  Fogel  and  Engerman  and  the  ILO  that  coerced  laborers  often  receive  income  close  to  that  of 
comparable  free  laborers.'' 

The  above-mentioned  partial  equilibrium  results  do  not  directly  answer  whether  labor  scarcity 
makes  coercion  more  likely.  To  investigate  this  issue,  and  the  robustness  of  our  partial  equilibrium 
results  to  general  equilibrium  interactions,  we  embed  our  basic  principal-agent  model  of  coercion 
in  a  general  (market)  equilibrium  setting,  with  two  distinct  equilibriimi  interactions.  The  first  is  a 
laboi^  demand  effect:  the  price  an  employer  faces  for  his  output  is  determined  endogenously  by  the 
production — and  thus  coercion  and  effort — decisions  of  all  employers  and  affects  the  marginal  product 
of  labor,  and  the  willingness  of  the  employer  to  use  coercion.  The  second  is  an  outside  option  effect: 
workers'  outside  options  affect  the  terms  of  their  contracts  and  the  level  of  coercion,  and  because  agents 
who  walk  away  from  a  coercive  relationship  (by  paying  the  cost  of  doing  so  determined  by  the  extent  of 
coercion)  may  match  with  another  coercive  employer,  eff'ective  outside  options  are  determined  by  the 
overall  level  of  coercion  in  the  economy.  Because  of  the  complementarity  of  effort  and  coercion,  these 
effects  compete  with  each  other,  which  makes  the  characterization  of  general  equilibria  a  nontrivial 
task;  formally,  the  set  of  equilibria  may  not  be  lattice  (e.g.,  Topkis,  1998),  so  it  is  impossible  to  apply 
the  standard  methods  used  for  comparative  statics  (e.g.,  Milgrom  and  Roberts,  1994,  or  Acemoglu 
and  Jensen,  2009).  Nevertheless,  we  show  that  a  certain  type  of  extremal  equilibria  has  well  behaved 
comparative  statics,  and  we  characterize  these  and  show  that  they  corroborate  both  the  results  of  our 
principal-agent  model  and  the  results  obtained  in  the  case  where  only  one  of  the  general  equilibrium 
channels  is  present.  ■  ■ 

More  importantly,  the  general  equilibrium  model  highhghts  various  different  channels  Unking  labor 
scarcity  to  coercion.  First,  the  labor  demand  effect  makes  coercion  more  likely  when  labor  is  scarce; 
this  is  reminiscent  of  Domar's  intuition,  whereby  greater  labor  scarcity  increases  prices,  and  via  this 
channel,  raises  the  value  of  effort  and  thus  encourages  coercion.''  Second,  the  outside  option  effect  tends 


^For  example,  the  ILO  constructed  their  2009  estimate  of  the  "Cost  of  Coercion"  by  estimating  that  forced  worliers 
are  paid  80%  of  the  prevailing  minimum  wage  (ILO,  2009). 

''A  recent  paper  by  Naidu  and  Yuchtman  (2009)  exploits  the  effects  of  labor  demand  shocks  under  the  Master  and 
Servant  Acts  in  19th-century  Britain  and  finds  evidence  consistent  with  this  labor  demand  effect. 


to  reduce  coercion  when  labor  is  scarce.  In  particular,  we  endogenize  outside  options  as  a  function 
of  labor  scarcity  and  show  that  greater  outside  options  lead  to  lower  equilibrium  effort  and  lower 
coercion.  This  is  similar  in  spirit  to  the  neo-Malthusian  theory  of  feudal  decline,  where  labor  scarcity 
increased  the  outside  opportunities  for  serfs,  particularly  in  cities,  and  led  to  the  demise  of  feudal  labor 
relations.  Interestingly  and  importantly,  our  theory,  though  consistent  with  the  neo-Malthusian  view, 
is  not  subject  to  Brenner's  (1976)  famous  critique  of  this  approach  based  on  the  observation  that  falHng 
populations  did  not  lead  to  a  decline  in  forced  labor  in  every  country  and  instance  during  this  era,  and 
in  particular  that  in  Eastern  Europe  they  were  associated  with  an  increase  in  the  prevalence  of  forced 
labor  in  the  so-called  "second  serfdom"  (see  Ashton  and  Philpin,  1985).  Our  model  in  fact  suggests 
that,  since  the  "second  serfdom"  appears  to  have  been  driven  by  the  increase  in  demand  for  agricultural 
goods  from  Western  Europe,  coercion  should  have  increased  during  this  episode.  Moreover,  the  main 
mechanism  that  could  have  acted  to  reduce  coercion  during  this  period,  the  increased  outside  option 
of  Eastern  European  workers,  was  probably  relatively  muted,  because,  in  contrast  to  Western  Europe, 
cities  were  less  important  as  alternative  employment  opportunities  for  East  European  serfs. 

Third,  we  also  identify  another  channel  via  which  labor  scarcity  reduces  coercion,  which  we  call  the 
economies  of  scale  effect.  The  main  idea  in  Acemoglu,  Johnson  and  Robinson  (2002)  is  that  setting  up 
coercive  and  extractive  institutions  becomes  more  profitable  when  there  are  more  numerous  laborers 
to  exploit.  This  can  be  captm'ed  in  our  model  when  employers  choose  their  investment  in  gims 
( "coerciveness" )  before  matching:  when  labor  is  abundant,  investing  in  coercion  is  more  likely  to  be 
useful,  so  coercion  increases. 

Finally,  the  general  equilibrium  analysis  also  enables  us  to  study  the  welfare  implications  of  "slave 
trade,"  whereby  employers  can  sell  their  coerced  agents  to  other  potential  employers.  We  show  that 
such  slave  trade  always  reduces  agent  welfare  and  may  reduce  social  welfare.  This  is  because  slave 
trade  shifts  the  productivity  distribution  of  active  employers  in  the  sense  of  first-order  stochastic 
dominance,  and  with  greater  productivity  comes  greater  coercion  and  lower  worker  utility. 

The  tractability  of  our  principal-agent  framework  also  enables  us  to  investigate  several  extensions. 
First,  we  introduce  ex  ante  investments  and  show  that  there  is  a  type  of  "holdup"  in  this  framework, 
where  agents  underinvest  in  skills  that  increase  their  productivity  in  their  current  coercive  relationship 
(since  their  utility  is  decreasing  in  their  productivity  with  their  current  employer)  and  overinvest  in 
skills  that  increase  their  outside  option  (since  this  makes  inducing  effort  more  costly  for  the  principal, 
which  leads  to  lower  coercion).  This  extension  provides  a  potential  (and  new)  explanation  for  why 
coercion  is  particularly  prevalent  in  effort-intensive,  low-skill  labor,  and  relatively  rare  in  activities 
that  require  investment  in  relationship-specific  skills  or  are  "care-intensive"  (as  argued  by  Fenoaltea, 
1984).  Second,  we  generalize  the  model  so  that  coercion  is  exercised  collectively  by  the  producers  (or 
by  groups  or  law  enforcement  agencies  under  their  control)  and  show  that  our  main  characterization 
and  comparative  static  results  extend  to  this  case.  Third,  we  investigate  the  consequences  of  a  different 
type  of  holdup  that  emerges  from  the  presence  of  coercion:  the  inabihty  of  the  employer  to  commit 


to  payments,  especially  when  he  has  many  guns.  Fourth,  we  show  that  coercion  plays  a  more  useful 
role  when  it  affects  the  interim  outside  option  of  the  agent,  because  it  allows  the  principal  to  use  ex 
post  punishments,  which  may  be  useful  because  of  the  presence  of  the  limited  liabihty  constraints. 
However,  we  also  show  that,  under  fairly  weak  assumptions,  this  effect  cannot  overturn  our  result 
that  coercion  is  socially  inefficient.  Finally,  we  consider  the  case  where  effort  and  productivity  are 
substitutes  in  a  producer's  production  function  and  show  that  in  this  case,  slave  trade,  conditional  on 
the  presence  of  coercion,  may  be  beneficial  by  both  increasing  the  productivity  of  active  employers 
and  reducing  coercion,  thus  increasing  slave  welfare.  ,  .        -''.  ■,        :. 

Despite  the  historical  importance  of  coercion,  the  literature  on  coercive  labor  markets  is  limited. 
Early  work  in  this  area  includes  Conrad  and  Meyer  (1958),  Domar  (1970),  Fogel  and  Engerman  (1974), 
and  Ransom  and  Sutch  (1977).  Bergstrom  (1971)  defines  a  "slavery  equilibriiom,"  a  modification  of 
competitive  equilibrium  in  which  some  individuals  control  the  trading  decisions  of  others,  and  shows 
existence  and  efficiency  property  of  slavery  equilibria.  Findlay  (1975)  presents  a  model  in  which  slave 
labor  is  produced  mechanically  by  wages  and  policing  of  slaves,  and  analyzes  how  long  it  takes  for 
a  slave  to  purchase  her  freedom.  Canarella  and  Tomaske  (1975)  also  view  a  slave  as  a  machine  that 
produces  output  as  a  function  of  payments  and  force.  They  suggest  that  the  profits  of  slaveholders 
is  a  poor  proxy  for  slave  welfare.  Barzel  (1977)  performs  a  comparative  analysis  of  slavery  and  free 
labor,  under  the  assumption  that  slaves,  but  not  free  workers,  must  be  monitored,  and  that  slaves 
(exogenously)  work  harder.  Fenoaltea  (1984)  posits  that  coercion  can  motivate  "effort- intensive"  but 
not  "care-intensive"  work  and  provides  a  historical  discussion  of  slavery  consistent  with  this  view.* 

In  more  recent  work,  Basu  (1986)  and  Naqvi  and  Wemhoner  (1995)  develop  models  in  which 
landlords  may  "coerce"  their  tenants  by  inducing  other  agents  to  ostracize  them  if  they  do  not  agree 
to  favorable  contract  terms  with  the  landlord.  Genicot  (2002)  develops  a  model  in  which  workers 
are  collectively  better  off  without  the  option  of  voluntarily  entering  into  bonded  labor  agreements,  as 
this  stimulates  the  development  of  alternative  forms  of  credit.  Conning  (2004)  formaUzes  and  extends 
Domar's  hypothesis  in  the  context  of  a  neoclassical  trade  model  with  a  reduced-form  model  of  slavery, 
and  provides  extensive  historical  analysis.  Lagerlof  (2009)  analyzes  a  dynamic  model  of  agricultural 
development,  arguing  that  slavery  is  more  likely  at  intermediate  stages  of  development.  In  his  model, 
as  in  Domar's  paper,  labor  scarcity  makes  slavery  relatively  more  likely  than  free  labor. 

The  paper  most  closely  related  to  ours  is  Chwe's  (1990)  important  work  on  slavery.  Chwe  analyzes 
a  principal-agent  model  closely  related  to  our  partial  equilibrium  model.  There  are  several  differences 
between  Chwe's  and  our  approaches.  First,  his  model  has  no  general  equihbrium  aspects  and  does  not 
investigate  the  relationship  between  labor  scarcity  and  coercion,  which  is  one  of  our  central  objectives. 
Second,  and  more  importantly,  in  Chwe's  model,  the  principal  cannot  affect  the  agent's  outside  option, 
whereas  all  of  our  main  results  follow  from  our  fundamental  modeling  assumption  that  coercion  is 


'See  also  Engerman  and  Sokoloff  (1997)  for  a  historical  analysis  based  on  the  view  that  certain  tasks  are  inherently 
better  suited  to  slave  labor.  Andrees  and  Belser  (2009)  also  argue  that  coercion  is  more  common  in  labor-intensive 
sectors  because  these  sectors  are  where  saving  on  labor  costs  is  most  important. 


about  affecting  the  outside  option  of  the  agent  (i.e.,  coercing  an  individual  to  accept  an  employment 
contract  that  he  or  she  would  not  have  otherwise  accepted).^  For  example,  this  modehng  assumption 
is  important  for  our  results  on  efficiency  (in  Chwe's  model,  coercion  is  typically  efficiency-enhancing). 
Third,  Chwe  also  imposes  strong  functional  form  assumptions  to  ensure  the  concavity  of  the  principal's 
problem,  whereas  we  establish  and  then  exploit  the  supermodularity  of  the  principal's  problem.^"  This 
modeling  strategy  is  central  for  our  general  equilibrium  analysis. 

The  rest  of  the  paper  is  organized  as  follows.  Section  2  introduces  our  principal-agent  model  and 
the  main  features  of  the  general  equilibrium  environments.  Section  3  characterizes  the  solution  to  the 
principal-agent  problem  with  coercion  and  presents  several  key  comparative  static  results.  Section 
4  studies  the  general  equiUbrium  model  and  investigates  the  relationship  between  labor  scarcity  and 
coercion.  Section  5  presents  several  extensions.  Section  6  concludes.  Appendix  A  contains  the 
proofs  of  Propositions  1  and  2  and  Lemma  3  presented  in  Section  3.  Appendix  B  briefly  considers  a 
generalization  of  the  basic  principal-agent  model  introduced  in  the  next  section  to  include  multiple 
levels  of  output.  Appendix  C  establishes  the  existence  of  equilibrium  under  the  most  general  conditions 
considered  in  Section  4. 

2     Model 

In  this  section,  we  describe  the  environment.  We  start  with  the  contracting  problem  between  a 
"coercive"  producer  and  an  agent,  and  then  describe  how  market  prices  and  outside  options  are 
determined.  We  then  provide  a  brief  roadmap  of  the  results  that  will  be  presented  in  the  next  two 
sections.  _-  ■      ^  "- ^  -'  .^ 

2.1      The  Environment 

There  is  a  population  of  mass  1  of  (coercive)  producers  and  a  population  of  mass  L  <  1  of  agents 
(forced  laborers,  slaves,  or  simply  workers);  L  is  an  inverse  measure  of  labor  scarcity  in  the  economy. 
All  agents  are  risk  neutral.  Each  producer  has  a  project  that  yields  x  units  of  a  consumption  good 
if  successful,  where  x  has  a  distribution  F  {x)  with  continuous  density  f{x)  and  support  [x,x],  with 
X  >  0.^^  We  refer  to  a  producer  with  productivity  x  alternately  as  a  producer  of  type  x.  All  projects 
yields  zero  output  if  unsuccessful.  All  workers  are  identical. 

Each  producer  is  initially  randomly  matched  with  a  worker  with  probability  L.  We  first  describe 
the  actions  and  timing  of  events  after  such  a  match  has  taken  place;  we  return  to  the  matching  stage 


Interestingly,  Chwe  agrees  with  this  perspective  and  writes  "one  forces  another  person  to  be  her  labourer  if  one  does 
so  by  (even  indirectly)  changing  her  reservation  utility,"  (p. 1110),  but  then  assumes  that  the  principal  cannot  change 
the  agent's  reservation  utility.  In  particular,  the  agent  in  his  model  has  exogenous  reservation  utility  U,  and  she  receives 
at  least  this  payoff  in  expectation  under  any  contract  she  accepts. 

We  do  assume  concavity  of  the  principal's  problem  for  one  of  our  general  equilibrium  results  (Proposition  5),  imposing 
identical  conditions  to  Chwe's  to  ensure  concavity. 

The  existence  of  a  continuous  density  is  assumed  to  simplify  notation  in  the  text  and  is  relaxed  in  Appendix  C. 
None  of  the  results  presented  here  are  affected  if  F  has  mass  points  or  is  Dirac  at  some  i. 


at  the  end  of  this  section.  ..,  ,.     ,-■ 

A  producer  with  productivity  x  who  is  matched  with  an  agent  first  chooses  a  level  of  guns,  g  >0, 
at  private  cost  tjx  {g),  and  simultaneously  offers  a  "contract"  specifying  an  output-dependent  wage- 
punishment  pair  {w^jP^)  for  y  e  {yi,yh},  where  yi  =  0,  and,  for  a  producer  with  productivity  x, 
yh  =  x}"^  We  denote  the  pimishment-wage  pairs  following  high  and  low  output  by  [yu'^^p^^  and 
(u'',p'),  respectively.  We  assume  that  the  producer  is  fully  committed  to  this  wage-punishment 
pair  (we  return  to  this  assumption  in  subsections  5.3  and  5.4).  Wages  and  punishments  have  to  be 
nonnegative,  i.e.,  w^  >  Q  and  p^  >  0,  so  the  agent  has  "limited  liability".  We  also  assume  that 
X  (0)  =  0  and  x{')  is  twice  differentiable,  strictly  increasing,  and  strictly  convex,  with  derivative 
denoted  by  x'  that  also  satisfies  x'  (0)  =  0  and  linig^oo  x'  (ff)  =  oo.  The  parameter  tj  >  Q  corresponds 
to  the  cost  of  guns,  or  to  the  cost  of  using  coercion  more  generally,  which  is  mainly  determined  by 
institutions,  regulations  and  technology  (e.g.,  whether  slavery  is  legal).  Following  the  contract  offer  of 
the  producer,  the  agent  either  accepts  or  rejects  the  contract.  If  she  rejects,  then  she  receives  payoff 
equal  to  her  (intrinsic)  outside  option,  u,  minus  the  level  of  guns,  g,  i.e., 

.  U-g, 

and  the  producer  receives  payoff  0.  The  interpretation  of  the  agent's  payoff  is  that,  if  she  rejects 
the  contract,  the  principal  inflicts  punishment  g  on  her  before  she  "escapes"  and  receives  her  outside 
option,  u}^  This  formulation  introduces  our  main  assumption  that  "coercion"  (or  "forced  labor") 
involves  using  force  or  the  threat  of  force  to  convince  an  agent  to  accept  an  employment  relationship 
that  she  might  have  rejected  otherwise.  In  light  of  this,  throughout  g  will  be  our  measure  of  coercion. 
We  can  also  think  of  u  —  g  as  the  agent's  "extrinsic"  outside  option,  influenced  by  the  coercion  of  the 
producer,  as  opposed  to  u,  which  could  be  thought  of  as  the  "intrinsic"  outside  option,  determined 
outside  the  coercive  relationship  (e.g.,  in  general  equihbrium). 

If  the  agent  accepts  the  contract  offer,  then  she  chooses  effort  level  a  G  [0,  1]  at  private  cost  c{a). 
Here  a  is  the  probability  with  which  the  project  succeeds,  leading  to  output  x.  We  assume  that 
c(0)  =  0  and  that  c(-)  is  strictly  increasing,  convex,  and  twice  differentiable,  with  derivative  denoted 
by  c',  and  we  also  impose  that  lima_,i  c'  (a)  =  oo  to  ensure  interior  solutions.  Suppose  that  the  market 
price  for  the  output  of  the  producer  is  P.  Thus  when  the  agent  accepts  contract  {ui^ ^p^)  and  chooses 
effort  a,  guns  equal  g,  and  output  equals  y,  the  producer's  payoff  is 

Py  -  w^  -rixig), 

and  the  agent's  payoff  is 


,,y  _  r,y 


p^  —  c  (a) 


''In  subsection  4.5,  we  consider  the  alternative  timing  in  which  producers  choose  g  before  the  matching  stage, 
'^This  formulation  implies  that  g  affects  the  the  agent's  utility  she  "escapes"  rather  than  her  utility  when  she  accepts 
the  employment  contract.   Thus  it  is  the  threat  of  force,  not  its  actual  exercise,  that  matters.    Nevertheless,  since  g  is 
chosen  at  the  beginning,  this  threat  is  only  credible  (feasible)  when  the  producer  undertakes  the  investments  in  coercive 
capacity  {"guns"). 


These  payoflF  functions  also  imply  that  punishment  is  costless  for  the  producer,  which  is  a  simphfying 
assumption.  An  equilibrium^  contract  (for  given  market  price,  P,  outside  option,  u,  and  producer  type, 
x)  is  the  subgame  perfect  equilibrium  of  the  above-described  game  between  the  producer  and  the 
agent.''*  Given  the  timing  of  events,  this  equilibrium  contract  is  given  by  the  solution  to  the  following 
maximization  problem  (for  a  producer  with  productivity  x): 


max 


a  [Px  -  «;'')  +  (1  -  a)  (-w')  -  ^  {g)  (1) 


subject  to 


(a,g,ui'',ui',p'',p')e[0,l]xE5^ 

a  (w'^  -  /)  +  (1  -  a)  (w'  -  p')  -  c  (a)  >  u  -  g,  (IRq) 


and 

a  e  arg  max  d  (w'^  -  PI  +  (1  -  5.)  (w'  -  p']  -  c(a) .  (ICo) 

aelo,i]     \  ^  ^  ^ 

Here  (IRq)  can  be  interpreted  as  the  "individual  rationahty"  or  "participation  constraint"  of  the 

agent.    If  this  constraint  is  not  satisfied,  then  the  agent  would  reject  the  contract — run  away  from 

the  match  with  the  producer.  (ICq)  is  the  "incentive  compatibility"  constraint,  ensuring  that  a  is  the 

agent's  best  response  in  the  subgame  following  the  contract  offer  and  her  acceptance  of  the  contract. 

There  is  no  loss  of  generality  in  letting  the  producer  choose  a  from  the  set  of  maximizers,  since  if 

the  producer  expected  an  effort  level  choice  from  this  set  that  did  not  maximize  his  payoff,  then  he 

would  have  a  profitable  deviation.  Thus  only  solutions  to  this  program  are  consistent  with  subgame 

perfect  equilibria  and  all  subgame  perfect  equihbria  generate  outcomes  that  constitute  a  solution  to 

this  program. 

2.2      Market  Interactions  and  General  Equilibrium 

To  complete  the  description  of  the  model,  we  next  describe  how  the  market  price,  P,  and  an  agent's 
outside  option,  u,  are  determined. 

As  noted  above,  matching  between  agents  and  producers  is  random,  so  the  distribution  of  pro- 
ductivity among  matched  producers  is  identical  to  the  unconditional  distribution.  Suppose  that  all 
producers  with  productivity  x  who  are  matched  with  agents  choose  coercion  g  (x)  and  induce  their 
agents  to  exert  effort  a{x)}^  Then,  average  productivity  among  matched  producers  is 

'     ■  Q=   f   a{x)xdF{x).         ■  (2) 


As  our  definition  of  general  equilibrium  below,  Defipition  1,  makes  it  clear,  not  every  equilibrium  contract  may  be 
part  of  a  (general)  equilibrium,  since  the  levels  of  P  and  u,  Vifhich  we  take  as  given  here,  may  not  correspond  to  general 
equilibrium  values.  One  might  thus  alternatively  refer  to  an  equilibrium  contract  as  an  "optimal  contract".  We  use  the 
term  equilibrium  contract  throughout  for  consistency. 

This  implies  that  we  are  focusing  on  pure  strategies.  This  focus  is  adopted  in  the  text  to  simplify  notation.  Appendix 
C  allows  for  mixed  strategies.  We  also  allow  such  mixed  strategies  in  subsection  4.2,  where  all  producers  are  assumed  to 
have  the  same  productivity,  since  in  that  context  this  assumption  would  be  much  more  consequential.  Also,  note  that 
a{x)  =  0  and  g[x)  =  0  are  allowed.  Below  we  impose  Assumption  1,  which  ensures  that  the  effort  level  is  always  positive 
for  all  X. 


The  aggregate  level  of  production  is  thus  QL,  and  we  assume  that  market  price  is  given  by  , 

■  '        P  =  PiQL),        ■  ;     ','       .     /  ,    .  (3) 

where  P  {■)  is  a  decreasing  demand  schedule.  Equation  (3)  captures  the  idea  that  greater  output  will 
reduce  price.  Equation  (2)  makes  the  equilibrium  price  P  a  function  of  the  distribution  of  efforts 
induced  by  producers. 

An  agent's  outside  option,  u,  is  determined  according  to  a  reduced-form  matching  model.  When  an 
agent  escapes,  she  either  matches  with  another  coercive  producer  or  escapes  matching  with  a  coercive 
producer  altogether  (e.g.,  running  away  to  the  city  or  to  freedom  in  the  noncoercive  sector).  We 
assume  that  the  probability  that  an  agent  who  exercises  her  outside  option  matches  with  a  randomly 
drawn,  previously  unmatched,  coercive  producer  is  7  £  [0, 1),  and  the  probability  that  she  matches 
with  an  outside,  noncoercive  producer  is  1  —  7.  In  this  latter  case,  she  obtains  an  "outside  option" 
u;  we  interpret  u  as  the  wage  in  the  noncoercive  sector  (and  it  will  be  endogenized  in  subsection  4.4 
as  a  function  of  employment  in  the  noncoercive  sector).  In  practice,  both  the  parameter  7  and  the 
exogenous  outside  option  u  measure  the  possibihties  outside  the  coercive  sector.  For  example,  both 
in  the  context  of  feudalism  and  in  the  context  of  forced  agricultural  labor  relations,  the  existence  of 
vibrant  cities  to  which  coerced  workers  may  escape  would  correspond  to  a  low  7  and  a  high  u. 

This  formulation  imphes  that  the  outside  option  of  an  agent  matched  with  a  coercive  producer,  u, 
satisfies  .  _ 

u  =  ')'(  r{u-g{x))dF{x))+il-j)u.  '  (4) 


Let  G  be  the  average  number  of  guns  used  by  (matched)  coercive  producers.  Then,  once  again 
using  the  fact  that  the  distribution  of  types  of  matched  producers  is  the  same  as  the  unconditional 
distribution  (because  of  random  matching),  the  average  number  of  guns  is 

G=    r  g{x)dF{x).  (5) 

Jx 

We  refer  to  G  as  the  aggregate  level  of  coercion  in  the  economy.  (4)  can  now  be  written  as 

u  =  u ^G.  ■  (6) 

-  1-7 

Intuitively,  (6)  states  that  an  agent's  outside  option  in  the  coercive  sector  equals  her  payoff  from 
exiting  the  coercive  sector  minus  the  aggregate  level  of  coercion,  G,  as  given  by  (5),  times  a  constant, 
7/(1  —  7),  that  is  increasing  in  the  difficulty  of  exiting  the  coercive  sector. 

Given  this  description,  we  now  define  a  pure-strategy  (general  or  market)  equilibrium  for  this 
economy,  referred  to  as  equilibrium  for  short. ■'^  As  noted  above,  we  focus  on  pure-strategy  equihbria  in 
the  text  (except  in  subsection  4.2)  to  simplify  notation,  and  relegate  the  discussion  of  mixed  strategies 
to  Appendix  C.  In  particular,  pure  strategies  allow  us  to  focus  on  the  functions  a  :  [x,x]  — >  M^.  and 


''The  term  "market  equilibrium"  may  be  more  appropriate  since  we  do  not  endogenize  the  function  P(). 


g  :  lx,x]  — »  R+,  which  determine  the  level  of  effort  induced  and  the  level  of  coercion  chosen  by  a 
producer  of  type  x,  to  represent  equilibrium  behavior. -^^ 

Definition  1  A  (pure-strategy)  equilibrium  is  a  pair  of  functions  {a*  {■) , g*  {■))  such  that,  for  each 
X  S  [2,  S],  {a*  (x)  ,g*  (x))  is  an  equilibrium  contract  given  market  price  P  and  outside  option  u,  and 
P  and  u  are  given  by  (3)  and  (6)  evaluated  at  (a*  (■)  >5*  (■))• 

Throughout,  we  impose  the  following  joint  restriction  on  P  (•),  L,  x,  u,  and  c(-): 

Assumption  1 


P\L  I   xdF  (x)  j  X  >  u  +  c'  (0) 


Assumption  1  states  that,  even  if  all  producers  were  to  set  a  (x)  =  1  and  g  (x)  =  0,  the  marginal 
product  of  effort  of  an  agent  matched  with  the  lowest-productivity  producer  would  be  greater  than  the 
agent's  outside  option  plus  her  cost  of  effort  at  a  =  0.  Our  analysis  below  will  show  that  Assumption 
1  is  a  sufficient  (though  not  necessary)  condition  for  all  matched  producers  to  induce  their  agents 
to  exert  positive  effort  (i.e.,  generate  positive  expected  output)  in  equihbrium.  Therefore,  imposing 
this  assumption  allows  us  to  focus  on  the  economically  interesting  case  and  simplify  the  exposition 
considerably. 

2.3     Roadmap 

In  the  next  section,  we  take  the  market  price,  P,  and  outside  option,  u,  as  given  and  characterize 
equilibrium  contracts  for  a  producer  of  type  x  G  [x,  x].  We  then  turn  to  the  characterization  of 
(general)  equilibrium  in  Section  4,  which  will  enable  us  to  discuss  issues  related  to  the  effects  of  labor 
scarcity  on  coercion,  as  well  as  to  verify  the  robustness  of  the  partial  equilibrium  effects  in  the  presence 
of  general  equilibrium  interactions. 

Several  important  points,  which  are  central  for  our  equihbrium  results  in  Section  4,  will  be  already 
apparent  from  our  "partial  equilibrium"  analysis.  In  particular,  our  model  distinguishes  between  two 
possible  channels  through  which  labor  abundance/scarcity,  L,  will  affect  the  equilibrium.  The  first  is 
the  value  of  marginal  product  of  effort,  given  by  P  {QL)  x  for  a  producer  has  type  x,  and  the  second  is 
the  agent's  outside  option,  {t  — 7G/(1  — 7).  The  overall  effect  of  labor  scarcity  will  work  through  both  of 
these  channels.  For  example,  an  increase  in  total  labor  (i.e.,  of  agents  that  can  be  used  in  the  coercive 
sector)  will  directly  reduce  the  market  price  P{QL),  and  we  will  see  in  the  next  section  (Proposition 
2)  that  this  will  reduce  the  amount  of  coercion  in  equilibrium  contracts  for  any  type  x.  This  channel 
therefore  captures  Domar's  basic  insight  that  labor  scarcity  encourages  coercion.   In  contrast,  labor 


''More  generally,  strategies  would  induce  a  probability  distribution  over  a  and  g  for  each  type.  Allowing  for  such 
probability  distributions  is  necessary  for  proving  existence  of  equilibrium  in  the  most  general  environment  we  consider 
below  and  also  becomes  necessary  when  the  distribution  of  productivity,  F,  has  mass  points  (see  Appendix  C).  Focusing 
on  pure  strategies  in  the  text  saves  on  notation  and  simplifies  the  exposition.  .._.,, 

10 


scarcity  may  also  affect  u,  both  via  u  and  via  the  aggregate  level  of  coercion,  G.  For  example,  labor 
scarcity  may  increase  u,  and  thus,  from  (6),  u.  Since  one  of  our  main  results  is  that  higher  u  (better 
outside  options)  leads  to  lower  coercion  (see  again  Proposition  2),  this  general  equihbrium  mechanism 
imphes  that  labor  scarcity  will  be  associated  with  lower  coercion,  capturing  the  ideas  suggested  by 
neo-Malthusian  theories  of  the  decline  of  feudalism,  whereby  greater  outside  wages  make  the  coercion 
of  serfs  more  difficult  to  maintain.  ,  :.  "  _ 

3     Equilibrium  Contracts  and  Comparative  Statics 

3.1      Equilibrium  Contracts 

Recall  that  an  equilibrium  contract  for  a  producer  of  type  x  is  a  solution  to  (1)  subject  to  (IRq)  and 
(ICo)-  Thus  an  equilibrium  contract  for  a  producer  of  type  x  is  simply  a  tuple  [a,g,w^,'w'',p^,p''^  £ 
[0, 1]  X  R^.  Our  first  result  provides  a  more  tractable  characterization  of  equilibrium  contracts  when 
they  involve  positive  effort  (a  >  0).  Throughout  the  paper,  we  use  the  notation  [z]_^_  =  max{z,0}. 

Proposition  1  Suppose  Px  >  u  +  c'  (0).  Then  any  equilibrium  contract  involves  a  >  0  and  g  >  0, 
and  an  equilibrium  contract  for  a  producer  of  type  x  is  given  by  {a,g,w  ,w\,p  ,p'')  such  that 

[a.g)  e  arg        max         Pxd—a  \{1  —  a)  c'  (a)  +  c{a)  -)r  u  -  ^1  , -(1  -  a)  [-de'  (d)  +  c[a)  +  u  —  g\,-rjx  (g) 

(a,g)6[0,l]xIR+  J+  l  j  _,_  . 

.  .  -       ■  ,.:.  .  (7) 

with  w''  =  p^  =  0,  u;''  =  (1  —  a)  c'  (a)  +  c{a)  +  u  —  g  >  0,  and  p'  =  ac'  (a)  ~  c{a)  —  u  +  g  >  0. 

Proof.  See  Appendix  A.   ■  -■  '         . 

Remark  1  The  condition  Px  >  Tl  +  c'  (0)  is  automatically  satisfied  when  Assumption  1  holds,  since 
P  >  P  (L  J^  xdF  (x)  j ,  X  >  X  and  u  <  u.  Thus  Proposition  1  always  holds  under  our  maintained 
assumption.  The  qualifier  Px  >  u  +  c'  (0)  is  added  for  emphasis,  since,  as  the  proof  illustrates,  it 
ensures  that  a  >  0,  which  is  in  turn  important  for  this  result.  Problem  (7)  is  not  equivalent  to  the 
maximization  of  (1)  subject  to  (IRq)  and  (ICo)  when  the  solution  to  the  latter  problem  involves  a  =  0. 

Remaurk  2  Proposition  1  states  that  (1  —  a)  c'  (a)  +  c{a)  +u  —  g  >  0  and  —ac'  (a)  +  c{a)  +u  —  g  <  0 
in  any  equilibrium  contract;  so  in  any  equilibrium  contract  the  right-hand  side  of  (7)  equals 

Pxa  —  a{l  —  a)  c'  (a)  —  ac  (a)  —  au  +  ag  —  rjx  [g]  ■  (8) 

Remark  3  The  maximization  problem  (7)  is  (weakly)  supermodular  in  (a,  g,  x,  P,  —u,  -rj).  We  show 
in  Appendix  A  (in  particular,  Lemma  7)  that  "generically"  (or  more  precisely,  for  all  parameter  values, 
except  for  possibly  one  value  of  each  parameter),  the  expression  —ac'  {a)  +  c{a)  +  u  —  g  is  strictly  less 
than  0  in  any  equilibrium  contract,  and  in  this  case,  we  show  that  (7)  is  strictly  supermodular  in 
{a.g,x,  P,  —u,  —rj)  in  the  neighborhood  of  any  equilibrium  contract.   This  gives  us  strict  rather  than 
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weak  comparative  statics  when  we  consider  changes  between  any  two  possible  parameter  vectors  (see 
Proposition  2  and  Appendix  A). 

Proposition  1  simphfies  our  analysis  considerably,  since  instead  of  solving  a  nonconvex  constrained 
maximization  problem,  we  can  focus  on  an  unconstrained  supermodular  maximization  problem. 

To  obtain  an  intuition  for  Proposition  1,  suppose  that  the  solution  to  (1)  (subject  to  (IRo)  and 
(ICo))  indeed  involves  a  >  0.  Then  recall  that  c  is  differentiable  and  that  the  first-order  approach 
is  valid  in  view  of  the  fact  that  there  are  only  two  possible  output  realizations  {y  G  {yi,yh}),  which 
imphes  that,  given  the  contract  offer  {w^jp'^),  the  agent's  maximization  problem  in  (ICq)  is  concave. 
Moreover,  recall  that  lima-*i  c'  [a)  =  oo,  so  the  solution  involves  a  <  1.  This  imphes  that  (ICo)  can 
be  replaced  by  the  corresponding  first-order  condition,  where  we  wTite  u'^  =  w^  -  p^  and  u^  =  w^  —  p' 
for  the  agent's  payoff  (without  effort  costs)  following  the  good  and  bad  outcomes: 

u^  -u^  =  c'  (a) . 

It  wastes  money  for  the  producer  to  both  punish  and  pay  the  agent  after  any  outcome,  so  p  =  0 
if  u''  >  0,  w''  =  0  if  u^  <  0,  p^  =  0  if  u'  >  0,  and  w'  =  0  ii  u^  <  0.  Therefore,  w''  =  [u'^]_^  and 
w^  =  [■"'],  so  (1)  can  be  reduced  to  the  following  constrained  maximization  problem: 


max  a  [  Px 

(a,S,u'',u')e[0,l]xK^ 


,h 


u-\      I  -  (1  -a)    u'J     -vxig)  (9) 


subject  to 


and 


au'^  -  {I  -  a)u^  -  c(a)  >  u-  g  ;      .  :,    •        , .:     .    '  '       (IR-i) 


u'' -  u^  =  c' (a) .  ■  '     ■  '  (ICi) 


Next,  using  (ICi)  to  substitute  for  p'  in  (IRi)  shows  that  this  problem  is  equivalent  to  maximizing  (9) 
subject  to 

u'' -{l-a)c'ia) -c{a)  >u- g.     '  (IR2) 

Finally,  using  (IR2)  to  substitute  u''  out  of  (9),  and  using  (IR2)  and  (ICi)  to  substitute  u'  out  of  (9), 
yields  (7).  Furthermore,  it  is  intuitive  that  any  solution  to  (7)  will  necessarily  involve  w'  =  0  and 
w  >  0  if  a  >  0,  so  that  the  contract  does  not  punish  the  agent  for  a  good  outcome  and  does  not 
reward  her  for  a  bad  outcome. 

Note  that  (7)  is  not  necessarily  concave,  so 'first-order  conditions  are  not  sufficient  for  character- 
izing an  equilibrium  contract.  Nevertheless,  it  is  concave  in  g  for  given  a.  This,  combined  with  the 
assumption  that  x'  (0)  =  0  and  limg_oo  x'  (9)  =  00,  implies  that  the  first-order  condition  with  respect 
to  g,  for  a  given  equihbrium  level  of  a, 

X'(5)  =  -,  ■  •  (10) 

V  .      . 
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is  necessary  and  sufficient  whenever  (7)  is  differentiable  (i.e.,  whenever  -u'  <  0,  which  holds  "generi- 
cally,"  as  shown  in  the  proof  of  Proposition  2  and  Lemma  7  in  Appendix  A).  This  immediately  implies 
that  a  producer  that  wishes  to  induce  higher  effort  will  use  more  guns.  Put  differently,  (7)  is  super- 
modular  in  a  and  g}^  Though  mathematically  simple,  this  result  is  both  important  for  our  analysis 
and  economically  somewhat  subtle.  One  might  have  presumed  that  high  effort  might  be  associated 
with  less  or  more  coercion.  Our  model  implies  that  it  will  be  associated  with  more  coercion  (but  see 
also  subsection  5.5).  The  logic  is  as  follows:  (IR2)  implies  that  coercion  is  valuable  to  the  producer 
because  it  allows  a  one-for-one  reduction  in  wages  when  the  agent  is  successful  (i.e.,  in  w^,  since 
u'^  =  lu  in  an  equilibrium  contract),  regardless  of  effort.  An  agent  who  exerts  high  effort  succeeds 
more  often  and  therefore  must  be  rewarded  more  often.  This  makes  coercion  more  valuable  to  the 
producer. 

This  observation  will  play  an  important  role  in  the  rest  of  the  analysis,  not  only  because  it  will 
lead  to  a  positive  covariation  between  a  and  g,  but  also  because  it  implies  that  the  set  of  equilibrium 
contracts  will  form  a  lattice  (in  the  usual  order  on  IR^)  and  allows  us  to  perform  comparative  static 
analysis  on  the  set  of  equilibrium  contracts  in  the  weak  set  order, ^'■'  even  though  (7)  is  not  concave. 
In  particular,  let  (a+ (x)  ,g+ (a;))  and  {a~  {x)  ,g~  {x))  be,  respectively,  the  greatest  and  the  least 
equilibrium  contracts  for  a  producer  of  type  x  (given  P,  u,  and  77).  These  are  well-defined  because, 
given  the  supermodularity  of  (7)  in  (a,  g),  the  set  of  solutions  is  a  lattice.  The  analysis  of  the  behavior 
of  the  set  of  equilibrium  contracts  in  the  weak  set  order  is  equivalent  to  the  analysis  of  the  behavior 
of  the  pair  of  contracts  (a"*"  (x)  ,g+  [x))  and  {a~  (x)  ,g~  (x))  (provided  that  they  move  in  the  same 
direction). ■^'^ 

Next,  recall  from  Remark  3  that  (7)  is  also  supermodular  in  (a,  g,  x,  P,  —u,  —77)  (and  thus  it  exhibits 
increasing  diflFerences  in  (a,  g)  and  (x,  P,  — u,  —rj)).  This  implies  that  changes  in  productivity,  x,  market 
price,  P,  outside  option,  u,  and  cost  of  guns,  rj,  will  have  unambiguous  effects  on  the  set  of  equilibrium 
contracts.  This  observation  enables  us  to  derive  economically  intuitive  comparative  static  results  from 
standard  monotonicity  theorems  for  supermodular  optimization  problems  (e.g.,  Topkis,  1998).  This 
discussion  thus  establishes  the  following  important  result: 

Proposition  2  The  set  of  equilibrium  contracts  for  a  producer  of  type  x  forms  a  lattice,  with  greatest 
and  smallest  equilibrium  contracts  (a"*"  (a;)  ,5"*"  (i))  and  {a~  (x) ,  g~  {x)) .  The  extremal  equilibrium 
contracts  (a"*"  (x)  ,g'^  (x))  and  {a~  (x)  ,g~  (x))  are  increasing  in  x  and  P  and  decreasing  in  u  and  rj. 

Proof.  See  Appendix  A.   ■ 


'^Formall)',  (10)  only  shows  that  (7)  is  supermodular  in  a  and  g  in  the  region  where  u'  <  0.  However,  direct  inspection 
of  (7)  shows  that  it  is  (weakly)  supermodular  everywhere. 

'''Recall  that  X  >  X'  in  the  weak  set  order  if  for  every  x  6  X  there  exists  x'  G  X'  with  x  >  x' ,  and  for  every  x'  £  X' 
there  exists  x  G  X  with  x  >  x' . 

^"if  X  and  X'  are  lattices,  X  >  X'  in  the  weak  set  order  if  and  only  if  the  highest  element  of  X  is  higher  than  the 
highest  element  of  X',  and  the  lowest  element  of  X'  is  lower  than  the  lowest  element  of  X. 
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The  results  contained  in  Proposition  2  are  intuitive.  Higher  x  and  P  both  increase  the  value 
of  a  successful  outcome  for  the  producer  and  thus  the  value  of  effort.  Since  effort  and  coercion  are 
complements,  both  a  and  g  increase.  This  is  the  essence  of  the  labor  demand  effect,  which  was 
discussed  in  the  Introduction  and  will  be  studied  in  greater  detail  in  the  next  section.  Higher  costs 
of  gims,  T],  makes  coercion  less  attractive,  and  thus  tends  to  reduce  g.  The  complementarity  of  effort 
and  guns  then  raises  the  marginal  cost  of  inducing  higher  effort,  since  now  the  producer  has  to  pay 
higher  wages  to  satisfy  (IR2)-  Thus  an  increase  in  r]  reduces  both  a  and  g.  The  intuition  for  the  effect 
of  outside  options  is  given  in  subsection  3.3. 

3.2      Discussion  of  Assumptions 

It  is  useful  to  briefly  discuss  the  role  of  various  assumptions  in  leading  to  the  sharp  characterization 
result  in  Proposition  1  and  to  (7),  which  will  play  a  central  role  in  the  rest  of  our  analysis.  Eight 
assumptions  deserve  special  mention.  First,  we  assmne  that  the  coercive  relationship  starts  with  a 
match  between  the  producer  and  the  agent,  and  the  only  reason  for  the  producer  to  offer  an  "attractive" 
contract  to  the  agent  is  to  prevent  her  from  running  away.  This  is  important  for  our  analysis,  since 
it  implies  that  producers  do  not  compete  with  each  other  in  order  to  attract  agents.  We  believe  that 
this  is  a  realistic  assumption  in  the  context  of  coercion.  Serfs  in  Europe  and  forced  laborers  in  Latin 
America  were  often  tied  to  the  land  and  employers  did  not  need  to  attract  potential  workers  into 
serfdom.  Slaves  throughout  the  ages  were  often  captured  and  coerced.  According  to  Andrees  and 
Belser  (2009),  even  today  many  forced  employment  relationships  originate  when  employers  are  able 
to  lure  workers  into  such  relationships,  for  example  by  promising  good  working  conditions  that  do  not 
materialize  once  workers  arrive  at  a  plantation  or  mine,  at  which  point  they  are  not  allowed  to  leave. 
Second,  we  use  a  principal-agent  model  with  moral  hazard.  Third,  we  impose  a  hmited  liability 
constraint,  so  that  the  worker  cannot  be  paid  a  negative  wage.  We  view  both  of  these  assumptions  as 
central  for  a  good  approximation  to  actual  coercive  relationships.  Inducing  agents  to  exert  effort  is  a 
crucial  concern  in  coercive  employment  relationships,  and  clearly  these  agents  cannot  make  (unlimited) 
payments  to  their  employers,  since  they  are  trapped  in  the  coercive  relationship  without  other  sources 
of  income.  From  a  theoretical  point  of  view,  both  of  these  assumptions  are  important  for  our  results 
(and  we  view  this  as  a  strength  of  our  approach  in  clearly  delineating  results  that  depend  on  distinctive 
features  of  coercive  relationships).  Relaxing  either  of  these  two  assumptions  would  imply  that  the 
employer  could  implement  the  "first-best"  level  of  effort,  apB  {x),  given  by  Px  =  d  [apB  i^)),  either 
by  dictating  it  or  by  choosing  large  enough  negative  payments  after  low  output  (given  risk  neutrality). 
In  particular,  in  this  case  the  problem  of  a  coercive  producer,  with  productivity  x,  could  be  written  as 
max/  h\  o-FB  {x)  i^Px  —  w'^)  —VX  id)  subject  to  aps  (x)  w^  —  c{apB  i^))  >  u  —  g.  Since  the  constraint 
will  necessarily  hold  as  equality,  this  problem  can  be  written  as 

incixapB{x)Px  -  {u- g  +  c{aFB{x)))  -  VXia)  ■ 

g>0 

This  problem  is  no  longer  strictly  supermodular  and  coercion  will  always  be  independent  of  both  u 

14  • 


and  P.   Therefore,  all  of  oiir  results  depend  on  the  principal-agent  approach  and  the  importance  of 
effort  and  moral  hazard  (and  limited  Hability). 

Fourth,  we  allow  the  principal  to  use  punishment  p  >  0.  The  presence  of  such  punishments 
is  another  realistic  aspect  of  coercive  relationships.  Moreover,  they  play  an  important  role  in  our 
theoretical  results  by  ensuring  that  the  participation  constraint,  (IRq)  or  (IR2),  hold  as  equahty.  In 
the  absence  of  such  punishments,  the  participation  constraint  can  be  slack,  in  which  case  there  would  be 
no  role  for  using  our  measure  of  coercion,  g,  to  reduce  the  (extrinsic)  outside  option  of  the  agent.  One 
could  construct  different  versions  of  the  principal-agent  problem,  where  the  participation  constraint 
holds  as  equality  even  without  punishments,  and  we  conjecture  that  these  models  would  generate 
similar  insights.  Our  formulation  is  particularly  tractable  and  enables  us  to  focus  on  the  key  role  of 
coercion  in  inducing  agents  to  accept  employment  terms  that  they  would  not  have  done  in  the  absence 
of  coercion  or  threats  of  coercion.  We  also  view  a  situation  in  which  the  participation  constraint  of  the 
agent  is  binding  as  a  good  approximation  to  coercive  relationships;  if  the  participation  constraint  were 
slack,  we  could  not  talk  of  a  situation  in  which  coercion  is  used  for  making  agents  accept  employment 
terms  that  they  would  not  otherwise  accept.  Instead,  one  might  imagine  a  situation  in  which  an 
interim  participation  constraint  is  slack;  in  subsection  5.4,  we  show  how  this  can  be  incorporated  into 
our  framework  and  demonstrate  that  it  does  not  change  the  general  economic  insights  derived  in  the 
rest  of  the  analysis. 

Fifth,  we  impose  Assumption  1  throughout.  This  assumption  implies  that  productivity  in  the 
coercive  sector  is  (sufficiently)  greater  than  u  and  thus  greater  than  agents'  (intrinsic)  outside  option, 
u.  This  makes  coercive  relationships  viable,  and  corresponds  to  situations  in  which  coercive  producers 
have  access  to  valuable  assets  for  production,  such  as  land  or  capital.  This  type  of  unequal  access 
assets  of  production  is  a  key  feature  supporting  coercive  relationships  such  as  serfdom,  forced  labor, 
or  slavery.  1  ■;■":.."  •■    ^  . 

Sixth,  we  assume  that  coercion  is  undertaken  by  each  producer,  and  thus  corresponds  to  the 
producer's  use  of  armed  guards,  enforcers  or  threat  of  violence  against  its  laborers.  In  practice, 
much  coercion  is  undertaken  jointly  by  a  group  of  producers  (for  example,  via  the  use  of  local  or 
national  law  enforcement,  or  the  judiciary  system,  as  was  the  case  in  the  US  South  both  before  and 
after  the  Civil  War,  e.g..  Key,  1949,  or  Ransom  and  Sutch,  1977).  Moreover,  even  coercion  by  each 
individual  producer  presumes  an  institutional  structure  that  permits  the  exercise  of  such  coercion.  A 
comprehensive  study  of  coercion  requires  an  analysis  of  the  politics  of  coercion,  which  would  clarify 
the  conditions  under  which  producers  can  use  the  state  or  other  enforcement  mechanisms  to  exercise 
coercion  and  pass  laws  reducing  the  outside  option  of  their  employees.  Our  analysis  is  a  crucial 
step  towards  this  bigger  pictiue,  since  it  clarifies  the  incentives  of  each  producer  to  use  coercion 
before  incorporating  details  of  how  they  will  solve  the  collective  action  problem  among  themselves 
and  cooperate  in  coercive  activities.  To  highlight  that  this  first  step  captures  and  clarifies  much 
of  the  economics  of  coercion,  in  subsection  5.2,  we  show  that  our  major  comparative  static  results 
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immediately  generalize  to  an  environment  in  which  producers  make  voluntary  contributions  to  a 
collective  coercion  technology. 

Seventh,  we  assume  bilateral  risk-neutrality.  The  effects  we  focus  on  in  this  paper  do  not  disappear 
in  the  presence  of  risk- aversion,  though  adding  risk-aversion  complicates  the  analysis,  as  one  might 
expect.  Nevertheless,  there  is  at  least  one  important  way  in  which  making  the  agent  risk-averse 
reinforces  our  central  intuition  that  effort  and  coercion  are  complementary.  Consider  the  case  where 
-u'  <  0,  so  that  the  sole  purpose  of  coercion  is  to  reduce  w^.  By  (IR2)  and  convexity  of  c(-),  u^  is 
increasing  in  a  (for  fixed  g),  and  increasing  g  allows  the  principal  to  reduce  u  one-for-one.  When  the 
agent  is  risk-averse,  the  wage  that  the  producer  must  pay  after  high  output  to  give  the  agent  utility 
u''  is  convex  in  u'^,  since  u''  is  a  concave  function  of  w^.  Therefore,  reducing  u'^  is  more  valuable  to 
the  principal  when  a  is  higher,  which  provides  a  second  source  of  complementarity  between  a  and  g 
in  the  principal's  problem.  Therefore,  our  general  presumption  is  that  risk-aversion  should  strengthen 
our  results. ^^ 

Finally,  we  assume  only  two  levels  of  output.  This  is  for  simplicity,  and  Appendix  B  shows  how 
our  results  can  be  generalized,  under  (relatively  mild)  additional  assumptions,  to  an  environment  with 
multiple  levels  of  output. ^^ 

3.3     Further  Comparative  Statics 

Despite  the  large  historical  literature  on  slavery  and  the  importance  of  the  issue  of  coercion  in  labor 
markets  both  in  the  past  and  today,  little  is  understood  about  the  relationship  between  coercion  and 
other  economic  variables.  For  example,  does  labor  scarcity  increase  coercion?  Does  coercion  increase 
effort,  productivity  and  output?  Does  it  increase  or  reduce  "economic  efficiency"?  Do  better  outside 
options  for  workers  lead  to  more  coercion?^"^  Although  the  first  of  these  questions  must  await  our 
general  equilibrium  analysis  in  the  next  section,  Proposition  2  enables  us  to  provide  comprehensive 
answers  to  the  remaining  questions  and  others.  Throughout  this  subsection,  we  simplify  the  exposition 
by  simply  stating  that  "a  variable  z  increases  coercion  or  effort"  to  mean  that  the  greatest  and  least 


The  reason  that  this  argument  is  not  completely  general  is  that  u  may  equal  0  in  an  equilibrium  contract.  However, 
if  the  agent's  utility  function  for  money,  u(w),  satisfies  u'"  {w)  <  0,  it  can  be  shown  that  the  producer's  problem,  the 
analogue  of  (7),  is  strictly  supermodular  in  (a,g)  regardless  of  u  (proof  available  from  the  authors  upon  request).  In 
fact,  the  producer's  problem  remains  strictly  supermodular  in  this  case  even  in  the  absence  of  limited  liability  and/or 
punishments.  Thus,  risk-aversion  provides  an  alternative  route  to  our  key  comparative  static  results,  provided  one  is 
willing  to  make  the  somewhat  unappealing  assumption  that  u'"  (w)  <  0. 

"'More  specifically,  we  show  that  provided  that  the  first-order  approach  is  valid  (which,  given  the  agent's  risk-neutrality, 
is  not  very  restrictive  when  we  use  Jewitt's  (1988)  sufficient  conditions),  the  multiple-output  analog  of  (7)  is  supermodular 
if  the  ratio  of  the  likelihood  ratio  evaluated  at  the  highest  possible  value  of  y  to  the  likelihood  ratio  evaluated  at  the 
lowest  possible  value  of  3/  is  increasing  in  a.  Intuitively,  this  condition  guarantees  that  the  eflFect  that  reducing  w'^  is  more 
valuable  to  the  producer  when  a  is  high  is  not  offset  by  the  new  possibility  that  an  increase  in  a  leads  to  a  decrease  in 
the  amount  by  which  a  given  increase  in  g  allows  the  principal  to  reduce  w^ .  The  details  are  provided  in  Appendix  B. 

The  only  previous  answer  to  this  question  in  a  contract-theoretic  model  is  provided  by  Chwe  (1990),  who  shows 
that,  under  assumptions  that  ensure  concavity  of  the  producer's  problem,  better  outside  options  lead  to  higher  payoffs 
after  both  output  realizations  for  the  agent.  Because  higher  payoffs  are  associated  with  less  ex  post  punishment,  this  can 
be  interpreted  as  better  outside  options  leading  to  less  "coercion".  In  Chwe's  model,  this  result  depends  on  the  agent's 
risk-aversion  (i.e.,  on  "income  effects"). 
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equilibrium  contracts,  (a"^  (x)  ,g'^  (x))  and  {a"  (x)  ,g~  (x)),  are  increasing  in  2.         1 

First,  Proposition  2  immediately  formalizes  the  outside  option  effect  on  coercion  discussed  in  the 
Introduction  (proof  omitted):  , 

Corollary  1  An  agent  with  a  worse  (intrinsic)  outside  option  (i.e..  lower  u)  is  subject  to  more  coer- 
cion. 

This  result  is  at  first  surprising.  Since  higher  g  offsets  the  effect  of  higher  u  (recall  (IRo)  or 
(IR2)),  one  might  have  expected  g  and  u  to  covary  positively.  This  presumption  would  also  follow 
from  a  possible,  perhaps  mistaken,  reading  of  Domar  (1970),  on  the  hypothesis  that  labor  scarcity 
corresponds  to  higher  u.  Corollairy  1  shows  that  the  opposite  is  always  the  case.  The  intuition  for 
this  result  is  interesting:  An  individual  with  a  worse  outside  option  (lower  u)  will  be  induced  to  work 
harder,  because  her  participation  constraint,  (IR2),  is  easier  to  satisfy.  As  discussed  in  the  previous 
subsection,  agents  working  harder  will  be  successful  more  often  and  will  be  paid  more  often.  This 
increases  the  value  of  coercion  to  the  producer.  Thus  a  lower  outside  option,  u,  is  associated  with 
more  coercion  in  equilibrium  contracts. 

Next,  we  turn  to  the  implications  of  coercion  on  worker  effort  and  productivity.  To  conduct 
comparative  statics  with  respect  to  coercion,  we  change  the  cost  of  coercion,  77.  Throughout  the 
paper,  when  we  make  comparisons  between  a  coercive  equilibrium  contract  (''coercion")  and  "no 
coercion,"  the  latter  refers  to  a  situation  in  which  either  we  exogenously  impose  g  =  0  or,  equivalently, 
77  — *  00.  Given  this  convention,  the  next  corollary  is  also  an  immediate  implication  of  Proposition  2 
(proof  omitted): 

Corollary  2   Coercion  (or  cheaper  coercion,  i.e.,  lower  rj)  increases  effort. 

This  result  may  explain  Fogel  and  Engerman's  (1974)  finding  that  productivity  was  high  among 
slaves  in  the  US  South  in  the  antebellum  period.  It  is  also  intuitive.  Coercion  and  effort  are  comple- 
ments, and  when  the  former  becomes  cheaper,  equilibrium  contracts  induce  greater  effort.  Conversely, 
if  coercion  is  banned  or  made  prohibitively  expensive,  the  producer  will  have  to  pay  higher  wages  to 
satisf}'  (IRo)  and  wiU  choose  to  induce  lower  levels  of  effort. 

The  next  corollary  is  immediate  from  the  analysis  in  subsection  3.1  and  shows  that  coercion  is 
unambiguously  bad  for  the  welfare  of  the  agent. '^^ 

Corollary  3   Coercion  (or  cheaper  coercion,  i.e.,  lower  rj)  reduces  agent  welfare. 

Proof.  From  the  analysis  leading  up  to  Proposition  1,  (IRq)  holds  as  equality,  and  thus  the  welfare 
of  the  agent  under  any  equilibrium  contract  is  equal  to  ?2  —  ^.  The  result  follows  from  the  fact  that  g 
is  decreasing  in  77.    ■ 


^''This  contrEists  with  Chwe's  (1990]  framework,  where  the  agent  receives  her  outside  option  (reservation  utihty) 
regardless  of  coercion. 
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Even  though  coercion  reduces  agent  welfare,  it  may  still  increase  some  measures  of  "economic 
efficiency"  or  net  output.  Fogel  and  Engerman  (1974),  in  fact,  not  only  documented  that  slaves  in 
the  US  South  had  relatively  high  productivity,  but  argued  that  the  slavery  system  was  "economically 
efficient,"  perhaps  in  the  sense  of  increasing  either  output  net  of  effort  costs  or  social  welfare.  Note 
that  these  differ  in  our  model  since  the  former  includes  neither  the  cost  of  guns  nor  the  welfare  costs  to 
the  agent  of  punishments;  that  is,  net  output  is  defined  as  Px  —  c  (a),  while  (utilitarian)  social  welfare 
is  defined  as  Pxa  —  a  (1  —  a)  c'  (a)  —  ac  (a)  —  au  +  ag  —  rjx{g)  +  u  —  g  (by  (7),  this  is  the  sum  of  the 
producer's  and  worker's  utilities).  We  now  examine  how  these  quantities  are  affected  by  coercion.  One 
might  conjecture  that,  since  effort  under  moral  hazard  is  typically  below  first-best,  coercion,  which 
increases  effort  (Corollary  2),  should  increase  an  appropriate  notion  of  economic  efficiency.  This  is  not 
the  case,  however,  as  shown  by  the  next  two  corollaries. 

First,  we  show  that  coercion  can  lead  to  effort  above  the  first-best  level  that  would  prevail  in  the 
absence  of  information  asynunetries  and  limited  liability  constraints  (i.e.,  apB  given  by  c'  {aps  )  = 
Px);  this  implies  that  coercion  can  in  fact  reduce  net  output. ^°  The  argument  leading  to  this  result 
is  simple.  Since  lima_i  c'  (a)  =  cx),  the  first  best  effort,  aps  ,  is  strictly  less  than  1.  We  show  that 
as  77  — >  0,  any  equilibrium  contract  involves  an  effort  level  a  arbitrarily  close  to  1  (see  the  proof  of 
Corollary  4).  The  intuition  for  this  is  that  coercion  allows  the  producer  to  "steal  utility  from  the 
agent,"  as  shown  by  (IRo)  or  (IR2).  Moreover,  since  the  agent  is  subject  to  limited  liability,  the 
transfer  of  utility  from  the  agent  to  the  producer  will  take  place  "inefficiently,"  by  inducing  excessive 
effort.  The  next  corollary  formahzes  this  argument. 

Corollary  4  There  exists  t]**  >  0  such  that  if  rj  <  rj** ,  then  any  equilibrium  contract  involves  effort 
a  strictly  greater  than  apB  . 

Proof.  From  the  proof  of  Proposition  2  in  Appendix  A,  there  exists  r]*  >  0  such  that  u'  <  0  in  any 
equilibrium  contract  with  a  >  0  if  77  <  77*.  Therefore,  if  77  <  77*,  an  equilibrium  contract  involves  an 
effort  level  that  solves  .    ,,       ■;      ,  •  ,,      ,,.-  , 


max  Pxd 

a€lO,l] 


(1 -a)c'(a) +  c(a) +u-  (x"    ''" 

■  77 


..  "H*^'''"'!? 


where  we  have  used  the  fact  that,  when  u'  <  0,  x^  (g)  =  a/r).  From  Proposition  1,  every  solution  a  is 
strictly  positive,  for  aU  77  >  0,  and  (1  -  a)c'  (a)  +  c{a)  +  u-  (x')"''  (~  J  >  0-  By  Proposition  2,  the 
smallest  and  greatest  solutions  are  decreasing  in  77.  For  77  >  0,  let  a  (77)  >  0  be  the  smallest  solution, 
and  note  that  as  77  decreases,  a  (77)  is  nondecreasing,  and  thus  remains  uniformly  bounded  away  from 


'  Technically,  the  fact  that  coercion  can  lead  to  effort  above  the  first-best  level  does  not  immediately  imply  that 
coercion  can  reduce  net  output,  as  it  is  possible  that  effort  without  coercion  is  far  below  the  first-best  level.  However, 
the  proof  of  Corollary  4  shows  that,  as  77  — >  0,  all  equilibrium  contracts  under  coercion  have  a  — •  1,  which  implies  that 
net  output  under  coercion  converges  to  —00.  So  the  statement  that  coercion  can  reduce  net  output  is  certainly  true,  and 
in  fact  coercion  must  reduce  net  output  if  77  is  very  low. 
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0.  Next,  since  the  rnaximand  is  right-differentiable  and  linia^i  c'  (a)  =  oo,  the  first-order  condition 

{l-a{v))ic'{a{v))+a{r,)c''{air,)))+c{a{^))  +  u={xT'[^)+P^ 

must  hold.  Now  consider  t]  — >  0.  For  any  a{r])  <  I  the  left-hand  side  is  finite,  while,  since  x  is  convex 
and  satisfies  limg_^oo  x'  (5)  =  oo-  the  right-hand  side  converges  to  00.  This  implies  that  a  (rj)  must 
also  converge  to  1,  and  thus  all  solutions  must  converge  to  1  as  ?]  converges  to  zero.  Since  apB  <  li 
this  completes  the  proof  of  the  corollary.    ■ 

The  second  part  of  the  answer  to  the  question  of  whether  coercion  can  increase  "economic  effi- 
ciency" involves  looking  at  (utilitarian)  social  welfare.  The  next  corollary  shows  that  this  measure  of 
social  welfare  is  always  lower  under  coercion. 

Corollary  5  Social  welfare  in  any  equilibrium  contract  under  coercion  (rj  <  00  j  is  strictly  lower  than 
social  welfare  in  any  equilibrium  contract  under  no  coercion. 

Proof.  Let  (a,  g)  be  the  effort  and  coercion  involved  in  an  equilibrium  contract  under  coercion.  Let 
5^*^  be  social  welfare  imder  coercion  given  (a,  5),  and  let  SW'^  be  the  infimum  over  all  equilibrium 
contracts  of  social  welfare  under  no  coercion.  We  have 

SW       =     Pxa  —  a{l  —  a)  c' (a)  —  ac{a)  —  au  +  ag  —  rjxig)  +  u  —  g 

<  Pxa  —  a  {1  —  a)  c'  {a)  —  ac  (a)  —  au  +  u 

<  max  Pxa  —  d{l  —  a)  c'  Id)  —  ac  (a)  —  du  +  u 

^    ■'■■       .  ■    •  ae(o,i] 

,  -    ■  ■  '        =    SW^,       ^ 

where  the  second  and  third  lines  are  immediate  since  g  >  0  hy  (10)  and  a  <  I,  and  the  fourth  line 
follows  because  the  maximand  in  the  third  line  is  the  same  as  the  maximand  in  (7),  i.e.,  as  (8),  with 
g  set  to  zero,  which,  by  definition,  characterizes  the  set  of  equilibrium  contracts  under  no  coercion.    ■ 

The  intuition  for  Corollary  5  is  simple:  coercion  is  a  costly  means  of  transferring  utihty  from  the 
agent  to  the  producer.  Therefore,  it  is  necessarily  overused  in  equilibrium.  Despite  the  simplicity  of 
this  intuition,  the  result  contained  in  Corollary  5  has  not  appeared  in  the  literature  to  the  best  of 
our  knowledge,  because  the  central  role  of  coercion  in  affecting  the  participation  constraint  has  been 
overlooked.  This  omission  is  also  responsible  for  Fogel  and  Engerman's  inference  from  the  observation 
that  slaves  exert  high  effort  that  the  slavery  system  may  have  been  "economically  efficient".  In  fact, 
the  inefficiency  of  coercion  in  the  labor  market  is  stronger  than  that  implied  by  Corollary  5.  In  the 
next  section,  we  show  that  coercion  can  be  Pareto  inefficient  in  the  sense  that  not  only  social  welfare 
decreases,  but  the  equihbriiom  allocation  involves  lower  welfare  both  for  producers  and  for  agents 
relative  to  an  allocation  where  g  =  0  (or  where  rj  — »  00). 

Another  immediate  implication  of  Proposition  2  is  the  following  (proof  omitted): 
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Corollary  6  A  ccerced  worker  is  better  off  when  matched  with  a  less  productive  producer  (i.e.,  a 
producer  with  lower  x). 

The  intuition  (and  the  proof)  is  simply  that  producers  with  higher  x  use  more  coercion  and  thus 
give  lower  welfare,  u  —  g,  to  their  agents.  Once  again,  though  straightforward,  this  corollary  has 
interesting  economic  imphcations.  One  of  these,  which  will  be  discussed  further  in  the  next  section, 
is  that  the  slave  trade  makes  agents  worse  off,  even  conditional  on  their  being  coerced. 

Finally,  we  consider  the  cross-sectional  relationship  between  coercion  and  expected  incentive  pay, 
assuming  that  cross-sectional  variation  is  generated  by  variation  in  x.  Proposition  1  implies  that  an 
equilibrium  contract  always  involves  w  =  Q  and 


,.ft 


w 


+  {l  -  a)c'  {a)  +  c{a). 


An  increase  in  x  leads  to  an  increase  in  a,  and  affects  w    only  through  its  effect  on  a.  We  have 

+  (l-a)c"(a) 


da 


1 


"  {(XT'  (§) 

+  (l-a)c"(a). 


vx"  (g) 

The  sign  of  this  derivative  is  ambiguous:  the  direct  effect  of  an  increase  in  a  is  to  increase  w^  (as  (IR2) 
binds),  but  an  increase  in  a  also  increases  g  (through  (10)),  which  reduces  w  (again  through  (IR2)). 
If  the  first  effect  dominates,  then  an  increase  in  x  leads  to  higher  extremal  values  of  g  and  w'^;  ii  the 
second  effect  dominates,  then  an  increase  in  x  leads  to  higher  extremal  values  of  g  and  lower  extremal 
values  of  w^.  The  former  case  is  particularly  interesting  because  it  provides  an  explanation  for  Fogel 
and  Engerman's  observation  that  workers  who  are  subjected  to  more  coercion  are  not  necessarily  less 
well-paid;  and  yet,  in  contrast  to  their  interpretation,  our  result  also  shows  that  this  has  nothing  to 
do  with  the  efficiency  of  slavery.  We  state  this  result  in  the  next  corollary  (proof  in  the  text): 

Corollary  7  Cross-sectional  variation  in  x  leads  to  a  positive  correlation  between  (the  greatest  and 
smallest  equilibrium  contract  values  of)  g  and  w^  if  dw^ /da  >  0  for  all  a,  and  leads  to  a  negative 
correlation  between  (the  greatest  and  smallest  equilibrium  contract  values  of)  g  and  w  if  dw  /da  <  0 
for  all  a.     ■  '  •  :~    , 

4     General  Equilibrium 

In  this  section,  we  characterize  and  discuss  the  comparative  statics  of  equilibria  as  defined  in  Definition 
1.  Our  main  objectives  are  twofold.  The  first  is  to  understand  the  relationship  between  labor  scarcity 
and  coercion,  which  was  one  of  our  main  motivating  questions.  The  second  is  to  investigate  the 
robustness  of  the  partial  equilibrium  insights  derived  in  the  previous  section,  including  those  concerning 
welfare. 
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We  first  recall  that  Assumption  1  ensures  that         '•  ,  -..,,■.,     ,.  ;    ;,■     -      ■    ■ 

P(QL)x>u  +  c'(0)  =  u- —^G+c'lO),       '        '   ''      ' 

1-7  -,  :-'.       -:■     - 

for  all  X  G  [x,  x] .  This  implies  that  Proposition  1  applies  and  characterizes  the  set  of  equilibrium 
contracts  given  P  and  u  (or,  alternatively,  G).  The  characterization  of  a  (general)  equiUbrium  then 
simply  amounts  to  finding  the  set  of  contracts  (mappings)  (a*  (■)  ,g*  (•))  satisfying  (7),  where  P  and 
u  are  given  by  (3)  and  (6)  evaluated  at  (a*  (•)  ,g*  (■)). 

At  first,  this  appears  to  be  a  relatively  simple  equihbrium  problem,  since  both  endogenous  (general 
equilibrium)  objects,  P  and  u,  depend  on  the  strategy  profile  of  producers  via  two  aggregates,  Q  and 
G,  as  defined  in  (2)  and  (5).  Therefore,  this  is  an  aggregative  game  (e.g.,  Acemoglu  and  Jensen,  2009). 
However,  in  the  most  general  case,  the  structure  of  equilibria  in  this  game  is  complex,  because  the  game 
has  multiple  equilibria  and  exhibits  neither  strategic  complements  nor  strategic  substitutes.  Loosely 
speaking,  when  a  set  of  producers  choose  higher  (a,g),  this  increases  both  Q  and  G,  but  the  increase 
in  Q  reduces  P  (since  the  function  P  {■)  is  decreasing)  and  discourages  others  from  increasing  their 
{a,g),  while  the  increase  in  G  reduces  u  and  encourages  further  increases  in  {a,g).  These  interactions, 
in  fact,  imply  that  the  set  of  equilibria  may  not  be  a  lattice,  making  the  characterization  of  equilibrium 
comparative  statics  a  challenging  task.  ' 

This  challenge  notwithstanding,  we  provide  a  fairly  tight  characterization  of  the  structure  of  equi- 
libria and  their  comparative  statics,  which  generalize  the  results  provided  in  Proposition  2.  We  also 
show  how  our  framework  flexibly  accommodates  the  three  different  channels  from  labor  scarcity  to 
coercion  discussed  in  the  Introduction.  We  now  provide  a  roadmap  for  the  rest  of  the  section.  For  the 
first  three  subsections,  we  focus  on  the  model  as  introduced  in  Section  2.  In  the  next  subsection,  we 
assume  that  only  one  of  the  two  aggregates,  Q  and  G,  is  relevant,  in  the  sense  that  either  P  {■)  =  P 
or  7  =  0.  In  this  case,  the  characterization  of  the  set  of  equilibria  becomes  tractable.^''  In  subsection 
4.2,  we  show  that  another  special  case,  in  which  aU  producers  have  the  same  productivity  x  and  (7)  is 
concave,  also  leads  to  sharp  comparative  static  results.  In  subsection  4.3,  we  develop  a  more  general 
approach  to  the  characterization  of  equilibria  and  provide  comparative  static  results.^''  These  results 
do  not  involve  the  analysis  of  extremal  equilibrium  (Q,  G)  pairs  (as  these  may  not  exist),  but  focus  on 
extremal  equihbria  either  for  Q  or  for  G,  and  show  that  equihbria  always  change  in  a  way  that  shifts 
these  extremal  equilibria  unambiguously  in  the  direction  of  the  partial  equilibrium  results. 

Beyond  proving  existence  of  equilibrium  and  generalizing  the  partial  equilibrium  results  of  the 
previous  section,  our  treatment  in  subsections  4.1-4.3  also  provides  the  first  answer  to  the  question 


"^In  the  first  case,  we  end  up  with  a  game  with  strategic  complements  (as  in  Milgrom  and  Roberts,  1994)  and  standard 
theorems  ensure  both  existence  of  equilibrium  and  comparative  statics  generalizing  the  partial  equilibrium  results  from 
the  previous  section.  In  the  latter  case,  we  have  an  aggregative  game  with  strategic  substitutes,  and  now  results  from 
Acemoglu  and  Jensen  (2009)  can  be  applied  to  derive  similar  results.  However,  instead  of  appealing  to  these  theorems, 
we  develop  a  new,  simple  approach  for  characterizing  extremal  equilibria  and  comparative  statics  in  both  cases.  This 
approach  is  useful  because  it  will  be  used  also  in  subsections  4.2  and  4.3,  where  neither  of  these  existing  methods  can  be 
used  for  analysis. 

'Existence  of  equilibria  in  the  general  model  of  subsection  4.3  is  established  in  Appendix  C. 
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concerning  the  relationship  between  labor  scarcity  and  coercion.  Except  in  the  case  in  which  P  (•)  =  P, 
labor  scarcity  increases  the  market  price  for  the  output  of  the  producers  and  thus  creates  a  labor 
demand  effect,  increasing  coercion  (thus  formalizing  and  extending  Domar's  (1970)  intuition).  In 
subsection  4.4,  we  modify  our  basehne  model  of  Section  2  by  allowing  u  to  depend  on  employment 
in  the  noncoercive  sector.  Labor  scarcity  now  also  reduces  employment  in  the  noncoercive  sector 
and  tends  to  increase  u  via  an  (equilibrium)  outside  option  effect.  Now,  supposing  that  P  {■)  =  P, 
comparative  statics  with  respect  labor  scarcity  are  reversed,  because  a  more  limited  supply  of  labor 
increases  outside  options  and  discourages  coercion. ^^  This  result  formalizes  and  extends  the  intuition 
of  the  neo-Malthusian  theories  of  the  decline  of  feudalism.  In  subsection  4.5,  we  consider  another 
modification  of  our  original  model  in  which  producers  choose  guns  before  matching.  In  this  case, 
labor  scarcity  discourages  coercion  via  an  economies  of  scale  effect,  since  a  given  investment  in  guns 
is  less  useful  when  there  is  less  labor  to  coerce  (i.e.,  a  lower  probability  of  matching  with  an  agent 
that  can  be  coerced).  This  mechanism  is  related  to  the  ideas  suggested  in  Acemoglu,  Johnson  and 
Robinson  (2002)  for  why  Europeans  were  more  likely  to  introduce  coercive  institutions  in  colonies 
with  greater  population  density.  Finally,  in  subsection  4.6,  we  revisit  the  welfare  analysis  in  general 
equilibrium  and  show  that  in  addition  to  the  result  that  coercion  reduces  (utilitarian)  social  welfare,  it 
may  in  fact  be  "Pareto  inefficient"  (harming  producers  themselves  as  well  as  workers).  We  also  show 
that  slave  trade,  even  conditional  on  the  presence  of  coercion,  reduces  the  welfare  of  workers. 

4.1      Single  Relevant  Aggregate 

In  this  subsection  we  derive  comparative  statics  under  the  assumption  that  either  P  {■)  =  P  (i.e., 
P{QL)  =  P  for  all  Q)  or  7  =  0.  This  imphes  that  only  one  of  the  two  aggregates,  Q  and  G,  is  relevant 
in  problem  (7).  Let  (a~ ,g~)  {x,  Q,  G)  and  [a^ ,g^)  (x,  Q,  G)  denote  the  smallest  and  greatest  solutions 
to  (7),  respectively,  as  functions  of  x,  Q,  and  G.  -     ■  .         .  , ,  . 

Suppose  that  P {■)  =  P,  so  that  (7)  can  be  rewritten  as  ■   •    ■■■_  ,       , 


max        aPx 

(a,g)e[0,l]xR+ 


(1  -  a)  c'  (a)  +c{a)  +  u ^G  -  g 


(1 


-ac'  (a)  +  c(a)  +  u G  —  g 

1-7 


-  vx  (g) 


+ 

This  does  not  depend  on  Q,  so  we  may  rewrite  {a~ ,g~)  {x,Q,G)  and  (a'^,g'^)  {x,Q,G)  as 
{a~,g~)  {x,G)  and  (a+,5"'")  {x,G).  Note  that  Proposition  2  implies  that  both  of  these  are  increasing 
in  both  x  and  G.  Now  define  ■■ 


g-{x,G)dF{x),  /    g+{x,G)dF[x 


(p{G,P,u,j,ri)  = 

That  is,  (p  [G,  P,u,^,T])  is  the  interval  with  endpoints  equal  to  the  greatest  and  smallest  aggregate 
levels  of  coercion  consistent  with  producers'  offering  equilibrium  contracts  given  aggregate  level  of 

'  Naturally,  if  both  u{-)  and  P  {■)  are  non-constant,  then  there  will  be  two  counteracting  general  equiHbrium  effects 
and  the  overall  impact  of  labor  scarcity  will  be  ambiguous. 
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coercion  G.  We  will  write  (p  {G)  for  (p  (G,  P,  u,  7, 77)  when  the  parameters  are  understood.  It  is  clear 
that  if  G  is  an  equihbrium  aggregate  level  of  coercion,  then  G  is  a  fixed  point  of  4>  (G).  The  converse 
is  also  true:  -        •  •  ,■,  ,^ 

Lemma  1  If  G  £  (p{G),  then  G  is  an  equilibrium  aggregate  level  of  coercion.  '       .        ^ 

Proof.  If  G  £  (p  (G),  then,  bj'  the  Intermediate  Value  Theorem,  there  exists  x*  G  [x,x]  such  that 

;    ■';/.'■■•     y'''^    G=   f     g- {x,G)dF{x)+  f   g+{x,G)dFix).  .  .' 

..,..•  ..      ^         .  Jx  .  Jx' 

The  strategy  profile  in  which  producers  of  type  x  <  x*  choose  {a~,g~)  {x,G)  and  producers  of  type 
x>  x*  choose  (a"*",  5+)  (a;,G)  is  an  equilibrium.    ■ 

The  next  lemma  shows  that  if  Go  is  the  smallest  [greatest]  fixed  point  of  4>{G),  then  Go  is  the 
smallest  [greatest]  element  of  the  set  cf){Go). 

Lemma  2  If  G^  is  the  smallest  fixed  point  of  4>{G),  then  G~  =  f^g^  {x,G~)dF  (x).  If  G^  is  the 
greatest  fixed  point  of  (p  (G) ,  then  G^  =  J    g'^  {x,G'^)dF  [x). 

Proof.  Suppose  G~  is  the  smallest  fLxed  point  of  0(G).  Then  G~  >  J^  g"  {x,G~)dF  (x),  since 
g~  (x,  G)  is  increasing  in  G  and  any  other  solution  g  (x,  G)  to  (7)  satisfies  g  (x,  G)  >  g~  (x,  G).  Thus 
to  obtain  a  contradiction,  suppose  that  G~  >  f^g~  {x,G^)dF  {x).  Since  j^  g~  {x,G)dF  {x)  is  in- 
creasing in  G  and  j^ g~  {x,0)dF (x)  >  0,  by  Tarski's  Fixed  Point  Theorem  (e.g.,  Theorem  2.5.1  in 
Topkis,  1998),  there  exists  G'  e  [0,  G~ )  such  that  G'  =  J^  g"  (x,  G')  dF  (x),  yielding  a  contradiction. 

Next,  suppose  G"*"  is  the  greatest  fixed  point  of  (?i(G).  Similarly  G"*"  <  J^  g'^  {x,G'^)dF  {x),  thus  to 
obtain  a  contradiction,  suppose  that  G"*"  <  J^  g^  (x,  G"*")  dF  (x).  Since  limg_oo  x'  (p)  =  cxd  and  a  <  1, 
there  exists  G  such  that  G  >  f^  g^  [x,G)dF{x).  So,  since  J^  g'^  (x,G)  dF  {x)  is  increasing  in  G, 
again  by  Tarski's  Fixed  Point  Theorem,  there  exists  G'  G  (G^,  G)  such  that  G'  =  J^  g^  (x,  G')  dF  (x), 
yielding  another  contradiction  and  completing  the  proof  of  the  lemma.    ■ 

Lemma  2  combined  with  supermodularity  of  (7)  in  {a,g)  imphes  that  corresponding  to  the  smallest 
[greatest]  fixed  point  of  (p{G),  G~  [G"*"],  will  be  the  smallest  [greatest]  aggregate  output  Q~  [Q'^] 
corresponding  to  any  fixed  point  of  4>{G).  Lemma  1  then  implies  that  if  Go  is  the  smallest  [greatest] 
equilibrium  value  of  G  and  Qq  is  the  smallest  [greatest]  equihbrium  value  of  Q,  then  there  exists  an 
equilibrium  with  Q  =  Qo  and  G  =  Gq.  It  is  then  inmiediate  from  (7)  that  (p  (G,  u,  7,  rj)  is  increasing 
in  P  and  7  and  decreasing  in  u  and  77,  for  all  G.  We  can  also  obtain  comparative  static  results  with 
respect  to  changes  in  the  distribution  of  productivity,  F.  For  this  purpose,  tlrroughout  the  rest  of 
the  paper,  we  use  the  first-order  stochastic  dominance  order  on  the  distributions  F  {■)  (recall  that 
F'  (x)  first-order  stochastic  dominates  F  (x)  if  F'  (x)  <  F  (x)  for  all  x),  and  use  "z  is  increasing  in 
f  (■)"  as  a  shorthand  for  "an  increase  in  F  {■)  in  the  first-order  stochastic  dominance  order  imphes 
an  increase  in  z").  Given  this  convention,  it  is  also  straightforward  that  0  (G,  u,  7,  r])  is  increasing  in 
F(-).  Furthermore,  by  (7)  and  the  fact  that  u  is  decreasing  in  G,  (p{G,u,^,rj)  is  increasing  in  G. 
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Proposition  3  Suppose  P  {■)  =  P.  Then  an  equilibrium  exists  and  the  set  of  equilibria  is  a  lattice. 
Furthermore,  the  smallest  and  greatest  equilibrium  aggregates  (Q,  G)  are  increasing  in  P,  7,  and  F  (■), 
and  decreasing  in  u  and  tj. 

Proof.  As  noted  in  the  proof  of  Lemma  2,  J^  g"  [x,  0)  dF  [x)  >  0  and  there  exists  G  such  that 
G  >  J^.  g'^  {^,G)  dF  (x).  Tiiis  implies  that  the  smallest  value  of  (f){0)  is  above  the  45°  line  and  the 
greatest  value  of  <p  (G)  is  below  the  45°  line.  Moreover,  (j)  is  monotone  and  has  well  defined  smallest 
and  greatest  values.  Thus,  by  Tarski's  Fixed  Point  Theorem,  it  has  an  intersection  with  the  45° 
line  between  0  and  G,  and  from  Lemma  1  this  implies  that  an  equilibrium  exists;  and  the  set  of 
intersections  (and  thus  the  set  of  equilibria)  is  a  lattice.  By  Lemma  2  and  the  supermodularity  of 
(7),  the  smallest  and  greatest  equilibria  are  well-defined.  The  argument  in  the  preceding  paragraph 
estabhshes  that  0(G)  is  increasing  in  G,  P,  7,  and  F{-),  and  decreasing  in  u  and  7],  for  all  G.  Since 
4>{G)  is  increasing,  Theorem  2.5.2  in  Topkis  (1998)  then  implies  that  the  smallest  and  greatest  fixed 
points  of  (p{G),  and  thus  the  smallest  and  greatest  equihbrium  aggregates,  are  also  increasing  in  P, 
7,  and  F  (•),  and  decreasing  in  u  and  rj.    m 

Note  that  Proposition  3  relies  only  on  the  supermodularity  of  (7)  and  the  monotonicity  of 
{a~ ,g~)  {x,Q,G)  and  (a"^,g+)  (x,  Q,  G)  in  G?^  In  the  converse  case  where  P{QL)  may  depend 
on  Q,  but  7  =  0,  so  that  (7)  does  not  depend  on  G,  an  equivalent  argument  using  monotonicity  of 
{a~,g~)  {x,  Q,  G)  and  (a""",  g^)  {x,  Q,  G)  in  Q  yields  another  analog  of  Proposition  2  (proof  omitted): 

Proposition  4  Suppose  7  =  0.  Then  an  equilibrium  exists,  and  the  set  of  equilibria  is  a  lattice.  Fur- 
thermore, the  smallest  and  greatest  equilibrium  aggregates  {Q,  G)  are  increasing  in  F  {■)  and  decreasing 
in  L,  u,  and  rj. 

Propositions  3  and  4  show  that  when  only  one  of  Q  and  G  is  relevant  in  (7),  an  equilibrium  exists 
and  its  comparative  statics  are  well-behaved — the  smallest  and  greatest  equilibrium  aggregates  are 
increasing  in  productivity  and  the  difficulty  of  leaving  the  coercive  sector,  and  are  decreasing  in  the 
outside  option  in  the  noncoercive  sector  and  the  cost  of  guns.  All  of  these  comparative  statics  are 
direct  consequences  of  the  comparative  statics  of  an  individual  producer's  problem  in  Proposition  2. 

In  addition,  as  a  first  answer  to  our  question  concerning  the  relationship  between  labor  scarcity 
and  coercion.  Proposition  4  formalizes  the  labor  demand  eff'ect  on  coercion,  which  implies  that  labor 
scarcity  leads  to  greater  coercion  as  suggested  by  Domar.  This  is  because  in  Proposition  2  both 
coercion  and  eff'ort  are  increasing  in  the  price  P,  since  greater  demand  for  labor  increases  both,  and 
Proposition  4  then  follows  from  the  fact  that  P-is  decreasing  in  L  (notably,  this  result  does  not  apply 
in  Proposition  3,  where  P  {■)  =  P).  Intuitively,  greater  labor  scarcity  reduces  production  and  increases 
output  price,  and  via  this  channel  increases  demand  for  labor  and  encourages  coercion  by  individual 
producers  and  in  the  aggregate. 


In  particular,  Proposition  3  does  not  exploit  the  fact  that,  when  P  {■)  s  P,  the  game  is  supermodular. 
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The  comparative  statics  derived  in  this  subsection  do  not  generally  hold  when  both  Q  and  G  are 
relevant  in  (7).  The  next  subsection  provides  one  case  in  which  both  Q  and  G  are  relevant  in  (7)  and 
analogous  results  do  hold,  while  subsection  4.3  considers  the  general  case.  . 

4.2      Homogeneous  Producers 

In  this  subsection,  we  simultaneously  allow  P  {QL)  to  depend  on  Q  and  allow  7  to  be  positive, 
but  require  that  all  producers  are  homogeneous,  i.e.,  they  all  have  the  same  productivity,  z,  and  in 
addition  we  also  assume  that  c(-)  satisfies  conditions  sufficient  to  guarantee  concavity  of  (7).  Since 
all  producer's  have  the  same  productivity  in  this  subsection,  we  also  relax  the  assumption  that  all 
producers  with  the  same  productivity  choose  the  same  {a,g)  pair  (see  also  Appendix  C). 

To  ensure  concavity,  assume  in  this  subsection  only  that  c  is  three  times  differentiable  and  that, 
for  all  a,  it  satisfies  ... 

^  ■-    -  /         ■■■        (l-a)c"'(a)>c"(a).        .  -,    .^.^   . ,-  (11) 

This  condition  guarantees  that  the  producer's  problem  (7)  is  strictly  concave.  It  is  a  slight  weakening 
of  the  sufficient  condition  for  concavity  of  the  producer's  problem  imposed  by  Chwe."^*'  If  a  is  an 
optimal  effort  for  to  producer,  then  concavity,  (7),  and  (10)  implies  that  (a,  {x')~  (  ^  ) )  i^  ^^^  unique 
solution  to  (7).  Homogeneity  of  producers  and  strict  concavity  of  (7)  implies  that  all  producers  choose 
the  same  equilibrium  contract  {a,g)  in  any  equihbrium.  In  particular,  this  implies  that  a  =  Q  and 
g  =  G  in  any  equihbrium.  This  fact,  together  with  the  observation  that  u'  <  0  "generically"  in  partial 
equilibrium,  implies  that  u'  <  0  "generically"  in  general  equiUbrium  (see  Lemma  7  in  Appendix  A), 
and  thus  we  can  write,  from  (10),  G  =  (x')~    [Q/^v)-  Now  define 

^{Q,x,'y,L,u,ri)  =  aiQ,  (x')~^  , 

so  that  4>  [Q,  X,  7,  L,  u,  T])  is  the  unique  aggregate  level  of  output  consistent  with  each  producer  choosing 
an  equilibrium  contract  given  aggregate  output  Q  and  the  unique  level  of  G  consistent  with  Q.  The 
main  role  of  our  assumptions  here  (homogeneous  producers  and  concavity)  is  that  they  guarantee  the 
existence  of  such  a  unique  level  of  G,  by  (10)  and  Lemma  7.  We  write  (piQ)  for  4>{Q,x,^,L,u,ri) 
when  the  parameters  are  obvious. 

It  is  clear  that  if  Q  is  an  equilibrium  aggregate  level  of  output,  then  Q  is  a  fixed  point  of  cp  (Q);  the 
converse  follows  from  an  argument  paraUehng  the  proof  of  Lemma  1.  Furthermore,  0  [Q,  x,  7,  L,  u,  t;) 
is  increasing  in  7  and  decreasing  in  L,  u,  and  rj.  To  see  this,  note  that  the  comparative  statics  for 
7,  L,  and  u  are  immediate  from  (7);  the  comparative  static  for  rj  follows  because  a  is  decreasing  in  ?] 
and  increasing  in  G  =  Q/iv-  The  comparative  statics  with  respect  to  x  are  ambiguous  because  o  is 


^°To  see  why  it  ensures  concavity,  it  is  sufficient  to  differentiate  (7)  wlien  w'^  >  0  and  u'  <  0.  Chwe  also  provides  a 
justification  for  this  apparently  ad  hoc  a.ssumption:  define  /  (•)  by  /  (—  log  (1  —  a))  s  c  (a),  so  that  /  {p)  is  the  cost  to 
the  worker  of  ensuring  success  with  probability  1  —  e"'';  then  one  can  verify  that  (1  —  a)  c'"  (a)  >  2c"  (a)  if  /'"  (•)  >  0. 
Our  condition  simply  weakens  this  to  (1  —  a)  c'"  (a)  >  c"  (a). 
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increasing  in  both  x  and  G  =  Q/xrj.^^  Finally,  since  G  =  Q/xtj  in  any  equilibrium,  it  is  immediate 
that  the  set  of  equilibrium  {Q,G)  pairs  forms  a  lattice. 

Thus  far,  ^  (Q)  is  similar  to  (j){G),  defined  in  Section  4.1.  An  important  difference,  however,  is 
that  4>{Q)  may  not  be  monotone,  while  monotonicity  of  0(G)  was  the  key  to  establishing  Lemma 
2,  which  showed  existence  of  extremal  {Q,  G)  equilibria.  However,  this  problem  can  be  overcome  in 
the  current  setting  thanks  to  two  features.  First,  we  already  know  that  the  set  of  equilibrium  {Q,G) 
pairs  is  a  lattice.  Second,  we  can  appeal  to  the  following  lemma,  which  replaces  the  monotonicity 
requirements  in  Leiruna  2.^^ 

Lemma  3  Suppose  F  (x,  t)  :  [0,  x]  x  R^  -^  [0,  x]  is  a  non-empty,  upper  hemi- continuous,  and  convex- 
valued  correspondence  that  is  increasing  in  t  in  the  weak  set  order  for  all  x.  Then  the  smallest  and 
greatest  fixed  points  of  F  (x,  t)  exist  and  are  increasing  in  t. 

Proof.  See  Appendix  A.    ■ 

Proposition  5  Suppose  that  all  producers  are  homogeneous  and  (11)  holds.  Then,  an  equilibrium 
exists,  the  set  of  equilibria  is  a  lattice,  and  the  smallest  and  greatest  equilibrium  aggregates  {Q,G)  are 
increasing  in  ■y  and  decreasing  in  L,  u,  and  tj. 

Proof.  0(0, x,7,  L,  £t,  7])  >  0  and  0(x,x,7,  L, {2,7;)  <  x,  and  ^{Q,x,'y,L,u,r])  is  non-empty,  upper 
hemi-continuous  and  convex-valued  by  standard  arguments.  Therefore,  the  result  follows  immediately 
from  the  fact  that  0  {Q,  x,  7,  L,  u,  rj)  is  increasing  in  7  and  decreasing  in  L,  u,  and  77,  by  Lemma  3  ■ 
Proposition  5  shows  that  comparative  statics  are  almost  exactly  as  in  Propositions  3  and  4  when 
producers  are  homogeneous  and  (11)  holds,  i.e.,  (1  —a)c!"  {a)  >  c"  (a).  The  intuition  is  that,  even 
though  both  Q  and  G  may  now  affect  (7),  the  assumptions  here  are  sufficient  to  ensure  that  the  set 
of  equilibria  is  a  lattice;  in  particular,  the  problem  is  effectively  reduced  to  one  where  Q  is  the  only 
aggregate,  with  G  fixed  at  (x')~    (Q/^v)  (from  the  first-order  condition  (10)). 

4.3      General  Case 

In  this  subsection,  we  consider  the  model  in  Section  2  without  further  restrictions;  thus  P  {QL)  may 
depend  on  Q,  7  is  potentially  positive,  and  producers  are  heterogeneous.    In  the  text,  we  restrict 
attention  to  pure  strategies  as  specified  in  Definition  1  to  simplify  notation.    To  prove  existence  of 
equihbria  in  this  general  setting,  we  need  to  consider  mixed  strategies;  this  is  done  in  Appendix  C. 
Given  a  map  g  (■)  :  [x,  x]  — »  R+,  let  us  define 

G{gi-))=    r  g[x)dF{x).     , 


'The  intuition  for  this  is  that,  while  an  increase  in  x  increases  a  for  fixed  {Q,G),  it  also  decreases  G  for  fixed  Q,  so 
the  overall  effect  on  a  of  increasing  x  is  ambiguous. 

"  0  (Q,  £,  7,  L,  u,  77)  is  a  function  of  Q,  not  a  correspondence,  so  the  proof  of  Proposition  5  relies  only  on  the  fact  that 
Lemma  3  holds  for  continuous  functions.  However,  the  proof  for  non-empty,  upper  hemi-continuous,  and  convex-valued 
correspondences  is  no  harder  than  for  functions,  so  we  state  and  prove  the  more  general  result  here. 
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That  is,  G  {g  {■))  is  the  aggregate  level  of  guns  when  a  producer  with  productivity  x  chooses  g{x). 
Next,  we  define  "^^'- ',:'■    ;   ;'  ;;,■  /'^,  ;  .. '. 

" //-,       T   ~     \—l   Q' :  there  exists  {a{-),g{-))  such  that  {a{x),g{x))  is  a  solution  to  (7)     1 
0  (Q,  7,  L,  u,  77)  =  j  g.^^^  aggregates  (Q,  G  {g  (■))) ,  and  Q'  =  J^  a  (x)  xdF  (x)  j  ' 

That  is,  0  maps  Q  and  parameter  values  to  those  Q'  that  are  equihbrium  levels  of  output  in  the 
modified  model  where  price  is  fixed  at  P{QL).  By  Lemma  1,  the  set  of  fixed  points  of  (p  coincides 
with  the  set  of  equihbrium  values  of  Q.    .,.,;-■—'' 

Lemma  4  The  smallest  and  greatest  elements  of  (f){Q,^,L,u,7])  are  increasing  in  F  {■)  and  7,  and 
decreasing  in  Q,  L,  u,  and  rj. 

Proof.  From  Proposition  3,  the  smallest  and  greatest  equilibrium  aggregates  {Q' ,  G")  are  increasing 
in  P,  F  {■),  and  7,  and  decreasing  in  u  and  77,  when  price  is  fixed  at  P.  Since  the  smallest  and  greatest 
elements  of  (p{Q,j,L,u,r])  are  the  smallest  and  greatest  equihbrium  levels  of  Q'  when  price  is  fixed 
at  P  {QL),  the  results  for  F  (•),  7,  u,  and  rj  follow  immediately  from  Proposition  3,  and  the  results  for 
Q  and  L  follow  from  Proposition  3  combined  with  the  fact  that  P  (QL)  is  decreasing.    ■ 

The  next  proposition  shows  that  the  comparative  static  results  obtained  so  far  continue  to  apply 
when  we  focus  on  the  smallest  and  greatest  equilibrium  values  of  output.  Note,  however,  that  this 
proposition  does  not  state  that  the  set  of  equilibrium  values  of  {Q,G)  is  a  lattice.  ,  • 

Proposition  6  The  smallest  and  greatest  equilibrium  values  of  Q  are  increasing  in  F  {■)  and  7,  and 
decreasing  in  L,  u,  and  rj. 

Proof.  From  Lemma  4,  greater  F {■)  or  7,  or  lower  L,  u,  or  77  shifts  ct){Q,'^,L,u,i])  up  for  aU  Q. 
Since  4>  (Q,  7;  L,  u,  r])  is  monotone  in  Q,  Theorem  2.5.2  in  Topkis  (1998)  imphes  that  the  smallest  and 
greatest  fixed  points  of  cf)  {Q,  7,  L,  u,  77)  are  increasing  in  F  {■)  and  7,  and  decreasing  in  L,  u,  and  rj.  ■ 
Next  we  can  obtain  the  same  results  for  the  smallest  and  the  greatest  equilibrium  values  of  G  by 
interchanging  the  roles  of  Q  and  G  (proof  omitted).  ■  ■ 

Proposition  7  The  smallest  and  greatest  equilibrium  values  of  G  are  increasing  in  F  (■)  and  7,  and 
decreasing  in  L,  u,  and  rj. 

Therefore,  whenever  we  focus  on  the  smallest  and  greatest  equilibrium  values  of  either  aggregate, 
there  are  unambiguous  comparative  statics  for  that  aggregate.  Intuitively,  this  is  because  whenever 
we  take  one  of  the  two  aggregates  as  a  "parameter,"  the  logic  in  subsection  4.1  applies  and  ensures 
monotonicity  of  the  producers'  best  responses.  Nevertheless,  this  is  not  sufficient  to  give  us  the  most 
general  comparative  static  results,  because  the  set  of  equilibrium  values  of  (Q,  G)  may  not  be  a  lattice; 
and  thus  one  cannot  state  these  results  for  the  smallest  and  greatest  values  of  the  pair  of  equilibrium 
aggregates  (Q,G).  ^ 
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Finally,  the  other  relevant  statement  missing  from  Propositions  6  and  7  concerns  the  existence  of  an 
equilibrimn.  As  noted  above,  we  retm-n  to  this  issue  in  Appendix  C,  where  we  use  the  distributional 
approach  to  equilibrium  in  games  with  a  continuiun  of  players,  originally  developed  by  Mas-Colell 
(1984),  to  establish  existence  of  mixed-strategy  equilibria,  and  also  generalize  Propositions  6  and  7  to 
mixed  strategy  equihbria. 

4.4     Endogenous  Wages  in  the  Outside  Sector 

We  have  so  far  assumed  that  the  wage  that  an  agent  receives  in  the  noncoercive  (outside)  sector,  u, 
is  constant.  This  assumption,  though  convenient,  removes  a  potential  general  equilibrium  effect  of 
labor  scarcity:  when  labor  becomes  more  scarce,  this  not  only  changes  its  marginal  product  in  the 
coercive  sector,  but  also  in  the  noncoercive  sector.  This  is  one  interpretation  of  the  neo-Malthusian 
theories  of  the  dechne  of  feudahsm  (e.g.,  Habakkuk,  1958,  Postan,  1973,  2002,  Leroy  Ladurie,  1974, 
North  and  Thomas,  1976),  where  the  mechanism  via  which  the  fall  in  population  triggered  the  decline 
of  feudalism  may  have  been  the  higher  wages  in  the  cities  resulting  from  the  overall  labor  scarcity  in 
the  economy.  We  now  extend  our  model  to  incorporate  this  (general  equilibrium)  outside  option. 

Outside  options  of  coerced  agents  should  naturally  be  related  to  opportunities,  and  thus  to  the 
level  of  employment,  in  the  noncoercive  sector.  We  now  introduce  this  feature  and  write  this  outside 
option  as  u{L]\r),  where  L^  denotes  employment  in  the  noncoercive  sector.  We  then  endogenize  L^  in 
the  following  natural  way:  suppose  that  total  population  is  equal  to  L  <  1,  and  suppose  also  that  each 
worker  initially  has  probability  1  —  7  of  matching  with  the  noncoercive  sector,  which  could  correspond 
to  "free  cities,"  and  has  probability  7  of  matching  with  a  randomly  drawn  coercive  producer.  If  a 
worker  separates  from  her  employer,  she  then  faces  probability  7  of  matching  with  a  coercive  producer 
and  probability  1  —  7  of  escaping  to  the  noncoercive  sector  as  before. ^'^  This  implies  that,  after  the 
first  round  of  matching,  the  environment  is  identical  to  our  benchmark  economy,  with  L  =  jL  agents 
matched  with  coercive  producers.  Our  analysis  so  far  imphes  that,  provided  that  Assumption  1  is 
satisfied,  the  participation  constraint  of  all  of  agents  matched  with  coercive  producers  will  be  hold 
(as  equality),  so  no  agent  will  leave  her  coercive  employment  relationship.  Regardless  of  the  value  of 
u,  the  remaining  (1  —  7)  L  agents  are  in  the  noncoercive  sector  after  the  first  round  of  matching,  and 
they  will  also  not  leave  for  a  second  round  of  matching  (since  this  could  result  in  matching  with  a 
coercive  producer,  giving  them  necessarily  lower  utility  than  u).  Therefore,  Z/yv  =  (1  —  7)  1/  is  total 
employment  in  the  noncoercive  sector.  Suppose  that  the  noncoercive  sector  has  access  to  a  technology 
of  production  Hpf  [L^)  with  everywhere  positive  marginal  product  of  labor  but  diminishing  returns, 
and  that  i7^  (•)  >  0  and  H'l^  (•)  <  0.  We  also  assume  that  workers  in  the  noncoercive  sector  are  paid 
their  marginal  product,  so  that  u{Ljm)  =  H'j^  {L^)-  Since  Ljsj  =  (1  -  7)^,  the  outside  option  in  the 
noncoercive  sector,  ^((l  —  7)i),  is  strictly  decreasing  in  Lyv  and  thus  in  L.  To  highlight  the  new 
mechanism  at  work  here,  let  us  finally  focus  on  the  case  where  P  (•)  =  P,  so  that  the  Domar  channel 


We  allow  for  two  different  matching  probabilities  7  and  7  (in  the  first  and  second  rounds  of  matching)  so  as  to 
separate  the  effects  of  the  two  in  the  comparative  statics. 

28 


highlighted  in  the  previous  three  subsections  is  turned  off.  The  next  proposition  characterizes  the 
equiHbrium  and  provides  the  main  comparative  statics. "^^  ,,  ..     ,   . 

Proposition  8  Consider  the  modified  model  presented  in  this  subsection  and  assume  that  P  (■)  =  P. 
Then,  an  equilibrium  exists  and  the  set  of  equilibria  is  a  lattice.  Labor  scarcity,  corresponding  to 
a  decline  in  L,  reduces  coercion.  More  formally,  a  decline  in  L  reduces  the  smallest  and  greatest 
equilibrium  aggregates  {Q,G).  The  remaining  comparative  statics  in  Proposition  3  continue  to  apply, 
and  thus  the  smallest  and  greatest  equilibrium  aggregates  {Q,  G)  are  increasing  in  P,  7,  and  F  (■),  and 
decreasing  in  u{{l  —  7)L)  and  rj.      _ 

Proof.  L  enters  the  extended  model  only  through  ^((l  —  7)-^),  so  a  decline  in  L  here  is  identical  to  an 
increase  in  u  in  the  model  of  subsection  4.1  with  P  (■)  =  P.  Then,  from  Proposition  3,  an  equilibrium 
exists  and  the  smallest  and  greatest  equilibrium  aggregates  {Q,G)  are  decreasing  in  u{{l  —  ■y)L),  so 
are  increasing  in  L.  Also,  again  from  Proposition  3,  the  smallest  and  greatest  equilibrium  aggregates 
{Q,G)  are  increasing  in  P,  7,  and  F(-),  and  decreasing  in  u  and  rj.    ■ 

This  proposition  shows  how  labor  scarcity  can  lead  to  lower  coercion  when  it  raises  the  outside 
option  of  workers.  The  most  relevant  application  of  this  proposition  is  to  situations  in  which  labor 
scarcity  increases  the  marginal  product  of  workers  in  noncoercive  sectors,  such  as  in  the  "free  cities" 
during  feudal  times  or  in  the  non-agricultural  activities  in  the  context  of  coercive  agricultural  labor 
relations.  When  this  happens,  labor  scarcity  raises  the  outside  option  of  coerced  agents  and  creates  a 
mechanism  towards  lower  coercion. 

Note  that  Proposition  8  clearly  illustrates  this  mechanism,  and  thus  the  intuition  implicit  in  neo- 
Malthusian  theories,  because  the  Domar  channel  highlighted  in  Proposition  4  is  turned  off.  If  we  were 
to  relax  the  assumption  that  P  {■)  =  P,  then  the  implications  of  labor  scarcity  would  be  given  by  the 
result  of  two  competing  effects.  In  general,  this  analysis  highlights  that  the  predictions  concerning 
whether  labor  scarcity  will  encourage  or  discourage  coercion  will  depend  on  whether  it  mainly  affects 
the  returns  to  producers  from  extracting  effort  or  the  outside  option  of  the  agents.  This  observation  is 
important  in  the  context  of  Brenner's  critique  of  neo-Malthusian  theories.  Brenner  (1976)  pointed  out 
that  neo-Malthusian  theories  were  inadequate,  because  in  many  instances,  and  most  notably  in  the 
episode  of  the  so-called  "second  serfdom,"  population  declines  were  associated  with  more — not  less — 
coercive  relationships  for  peasants.  Brenner's  interpretation  was  that  the  neo-Malthusian  approaches 
were  fundamentally  inadequate  because  they  ignored  the  essence  of  feudalism  as  a  set  of  coercive 
relationships.  Instead,  Brenner  favored  a  Marxian  theory  based  on  class  conflict  (though  his  theory 
is  not  fully  articulated).  Our  model  is  consistent  with  the  spirit  of  Brenner's  objections  to  the  neo- 
Malthusian  theories,  since  it  expUcitly  models  the  coercive  relationship  (though  presumably  not  in  the 
way  that  Brenner  himself  would  have  done) .  An  interesting  imphcation  is  that  our  model  is  potentially 


^""In  Proposition  8,  the  statement  that  the  smallest  and  greatest  equilibrium  aggregates  [Q,G)  are  decreasing  in 
"((1  ~  l)L)  means  that  the  greatest  and  smallest  equilibrium  aggregates  decrease  as  the  function  u{-)  evaluated  at 
(1  —  7)L  increases. 
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consistent  both  with  the  decline  of  feudalism  and  the  coercive  labor  relations  that  it  entailed  in  Western 
Europe  following  large  declines  in  population  and  also  with  the  increase  in  coercion  in  Eastern  Europe 
during  the  "second  serfdom"  episode:  in  the  latter  case,  the  decline  in  population  was  associated 
with  a  significant  increase  in  the  demand  for  agricultural  goods  produced  by  East  European  workers, 
coming  from  Western  Europe,  which  translates  into  an  increase  in  P  in  the  context  of  our  model;  and 
it  is  plausible  that  the  increase  in  the  outside  option  of  East  European  workers  would  have  been  much 
more  muted  than  their  West  European  counterparts  because  of  the  relative  paucity  and  weakness  of 
cities  in  this  area  (which  in  the  context  of  the  current  model  would  correspond  to  higher  7  or  a  lower 
u  (■)  function  in  Eastern  Europe  than  in  the  West).  This  discussion  illustrates  that  the  general  model 
that  allows  both  for  P {•)  and  u{-)  to  be  non-degenerate  functions  can  lead  to  significantly  richer, 
and  more  subtle,  comparative  statics,  and  also  highlights  that  the  predictions  in  general  depend  on 
whether  labor  coercion  mainly  affects  the  returns  to  producers  from  extracting  effort  or  the  outside 
option  of  the  agents. 

4.5     Labor  Scarcity  and  the  Returns  to  Investment  in  Guns 

In  this  subsection,  we  highlight  another  general  equilibrium  mechanism  Unking  labor  scarcity  to  coer- 
cion, which  will  lead  to  comparative  statics  similar  to  those  obtained  in  the  previous  subsection.  The 
underlying  idea  is  related  to  Acemoglu,  Johnson  and  Robinson's  (2002)  argument  that  labor  coercion 
is  more  profitable  when  there  is  abundant  labor  to  coerce  (because  when  labor  is  scarce,  coercion 
can  exploit  "economies  of  scale").  This  channel  is  absent  in  our  model  so  far,  because  each  employer 
employs  (matches  with)  at  most  one  worker  and  coercion  decisions  are  taken  conditional  on  the  match. 
An  alternative  timing  of  events  is  to  assume  that  producers  invest  in  guns  before  the  matching  stage. 
This  minor  change  in  timing  introduces  the  above-mentioned  economies  of  scale  effect  and  implies 
that  investment  in  guns  will  be  less  profitable  when  producers  are  relatively  unlikely  to  match  with 
workers,  that  is,  when  labor  is  scarce.  As  in  the  previous  subsection,  to  bring  out  this  particular 
general  equilibrium  effect,  we  abstract  from  Domar's  mechanism  by  assuming  that  P  {■)  =  P. 

The  only  difference  from  our  baseline  analysis  is  that  producers  choose  g  before  they  learn  whether 
they  are  matched  with  an  agent.  This  imphes  that  they  have  a  two-stage  decision  problem,  whereby 
they  first  choose  g  before  matching  and  then,  after  matching,  they  propose  a  contract  to  the  agent, 
which,  as  before,  can  be  summarized  by  (a,  g).  Even  though  this  is  a  two-stage  decision  problem,  there 
is  no  loss  of  generality  in  formulating  it  mathematically  as  producers  choosing  a  and  g  simultaneously 
to  solve  ', 


max        L  I  aPx 

(a,g)G[0,l)xR4. 


;i  -  a)  c  (a)  +cia)  +  u+  t-^—G  -  g 

1-7 

{I -a)    -ac  {a) +  c{a)  + u+ —^ — G-g 

1-7 


-vxig), 


with  the  interpretation  that  a  is  their  planned  level  of  effort  to  induce  if  they  match  with  an  agent, 
and  where  we  have  again  substituted  out  the  incentive  compatibility  and  participation  constraints 
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under  Assumption  1.  Re^vriting  this  as 


max        aPx 

(a,s)€[0,l]xR+ 


(1  -  a)c  la)  +  c{a)  +  u^ —G  -  g 

1-7 


7 
(1  -a 


ac'  (a)  +  c{a)  +  u-\ G  —  g 

1  -7 


-  7\(ff), 


we  see  that  changing  the  timing  of  tlie  model  by  requiring  producers  to  choose  g  before  learning 
whether  they  are  matched  with  an  agent  is  formally  identical  to  replacing  the  cost  of  guns  tj  with  rj/L. 
This  imphes  that  the  analysis  of  the  set  of  equilibria  is  identical  to  that  in  Proposition  3,  and  thus  we 
immediately  obtain  the  following  result  (proof  omitted): 

Proposition  9  Consider  the  modified  model  presented  in  this  subsection  and  assume  that  P  (•)  =  P. 
Then,  an  equilibrium  exists  and  the  set  of  equilibria  is  a  lattice.  Labor  scarcity  reduces  coercion,  that  is, 
a  decline  in  L  reduces  the  smallest  and  greatest  equilibrium  aggregates  {Q,G).  Moreover,  the  smallest 
and  greatest  equilibrium  aggregates  {Q,G)  are  increasing  in  P,  7,  and  F{-),  and  decreasing  in  u  and 
V- 

Therefore,  this  proposition  formalizes  another  channel  via  which  labor  abundance  can  encourage 
coercion,  and  thus  complements  Proposition  8  in  the  previous  subsection.  As  in  the  previous  subsec- 
tion, if  we  relax  the  assumption  that  P  {■)  =  P,  the  implications  of  labor  scarcity  for  coercion  will  be 
given  by  competing  effects. 

4.6      Welfare  Analysis  in  General  Equilibrium 

We  now  revisit  the  welfare  results  from  Section  3.3,  which  showed  that  making  coercion  cheaper, 
despite  leading  to  higher  effort,  reduces  utihtarian  social  welfare.  We  first  show  that  this  conclusion 
still  holds  in  general  equilibrium.  We  then  briefly  investigate  the  implications  of  "slave  trade,"  where 
coerced  workers  can  be  sold  from  one  producer  to  another.  Finally,  we  show  that  coercion  in  general 
equilibrium  can  lead  to  Pareto  dominated  outcomes. 

To  understand  the  general  equiUbrium  welfare  implications  of  coercion,  first  note  that  extremal 
equilibrium  values  of  Q  and  G  are  decreasing  in  rj  (from  Propositions  6  and  7,  or  fi-om  Propositions 
3-5,  when  they  apply).  Second,  we  can  observe  that  Corollary  4 — which  shows  that  any  equilibrium 
contract  involves  greater  than  first-best  effort — continues  to  hold  with  endogenous  Q  and  G,  replacing 
"any  equihbriimn  contract"  with  "the  lower  bound  on  equihbrium  Q;"  this  follows  from  an  argument 
similar  to  the  proof  of  Corollary  4. 

Most  importantly,  we  can  also  establish  that  the  presence  of  coercion  reduces  (utilitarian)  social 
weffare  for  each  matched  producer-worker  pair.  Again  from  Proposition  6  (or  from  Propositions  3- 
5),  the  smallest  and  greatest  equilibrium  values  of  Q,  Q~  and  Q"*",  are  decreasing  in  rj.  Therefore, 
taking  77  to  infinity,  which  is  equivalent  to  setting  g  =  0  exogenously,  reduces  the  smallest  and  greatest 
equilibrium  values  of  Q  to  Q~'  <  Q~  and  Q"*"'  <  Q'^ .  Let  G~  be  the  smallest  equihbrium  value  of  G. 
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With  coercion,  the  lower  bound  on  equilibrium  social  welfare  of  a  matched  producer  with  productivity 
X  and  a  corresponding  matched  agent  is  at  most 

a  (x)  P  [Q'^L)  X  —  a{x)  {1  —  a  (x))  c'  (a)  —  a (x)  c (a  (x))  +  {1  —  a{x))u 

-  (1  -  a  (x))  -^G-  -  (1  -  a  (x))  ^  (x)  -  TO  {g  (x)) 

1  -7 

<  a{x)P  {Q'^L)  X  -  a{x){l  -  a  (x))  c'  (a  (x))  -  a{x)c (a  (x))  +  (1  -  a  (x))  ii 

<  max  dP  [Q~^L)  x  —  a  (1  —  a)  c'  {a)  —  dc{a)  +  (1  —  d)  u 
ae[0,i] 

<  max  dP  (Q"'''L")  x  —  d  (1  —  a)  c'  (d)  —  dc  (d)  +  (1  —  d)  u, 

~     5e|0,i]        ^^       > 

where  the  last  line  is  a  lower  bound  on  total  welfare  of  the  same  pair  of  producer  and  agent  in  any 
equilibrium  with  g  =  0  (since,  from  Proposition  1,  uj'^  >  0  when  g  =  0).  Similarly,  the  upper  bound 
on  total  equilibrium  welfare  of  the  pair  is  at  most 

a  (x)  P  [Q~ L)  X  —  a  (x)  (1  —  a  (x))  c'  (a  (x))  —  a  (x)  c  (a  (x))  +  (1  —  a  (x))  u 

-  (1  -  a  (x))  7^-G-  -  (1  -  a  (x))  g  (x)  -  ^  {g  (x)) 

1  -7 

<  a  (x)  P  {Q" L)  X  —  a  (x)  (1  —  a  (x))  c'  (a  (x))  —  a  (x)  c  (a  (x))  +  (1  —  a  (x))  u 

<  max  dP  (Q~L)  X  —  d  (1  —  d)  c' (d)  —  dc(d)  +  (1  —  d)  {i 
oe[o,i] 

<  max  dP  (Q~'L)  x  —  d  (1  —  d)  c'  (d)  —  dc  (d)  +  (1  —  d)  u, 
ae[o,i] 

where  the  last  hne  is  now  the  upper  bound  on  equilibrium  total  welfare  of  the  pair  when  g  =  0.  This 
establishes  that  the  upper  [lower]  bound  on  equilibrium  total  welfare  for  each  pair  is  less  than  the 
upper  [lower]  bound  on  welfare  when  g  ^  0.  Aggregating  across  all  pairs  implies  that  upper  [lower] 
bound  on  total  welfare  in  the  society  is  less  than  the  upper  [lower]  bound  on  total  welfare  without 
coercion,  which  generalizes  our  partial  equilibrium  result  (Corollary  5).  We  summarize  this  discussion 
in  the  following  corollary  (proof  in  the  text):  -      ;    ' 

Corollary  8  Social  welfare  in  any  equilibrium  under  coercion  (rj  <  ooj  is  strictly  lower  than  social 
welfare  in  any  equilibrium  under  no  coercion. 

We  next  turn  to  the  issue  of  "slave  trade".  We  have  so  far  assumed  that  agents  are  matched  to 
coercive  producers,  and  contracting  and  production  take  place  between  this  matched  pair.  This  setup 
thus  allows  labor  coercion,  but  rules  out  "slave  trade,"  whereby  agents  subject  to  coercion  are  sold 
from  one  producer  to  another.  We  next  investigate  how  this  type  of  trading  affects  the  equilibrium 
and  its  welfare  properties. ^^ 


°Note  that  this  is  different  from  the  effect  of  "slave  trade"  that  would  bring  additional  slaves  from  the  outside,  as  the 
Atlantic  slave  trade  did  until  its  abolition  in  1807.  We  show  that  slave  trade  reduces  worker  welfare  even  if  one  ignores 
its  effect  on  the  number  of  coerced  workers  (which  is  outside  our  model). 
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The  answer  to  this  question  is  straightforward  to  develop  given  our  analysis  so  far.  Recall  that 
so  far  matched  producers  are  a  random  selection  from  the  population  of  producers,  and  thus  their 
productivity  distribution  is  given  by  F{x).  Since  L  <  1,  some  producers  are  left  unmatched.  Now 
suppose  that,  before  the  investment  in  guns,  agents  can  be  bought  and  sold  among  producers.  Since 
more  productive  (higher  x)  producers  have  a  greater  wiUingness  to  pay  for  coerced  agents,  this  trading 
will  ensure  that  all  of  the  agents  will  be  employed  by  the  most  productive  L  producers.  Consequently, 
the  distribution  of  productivity  among  matched  producers  will  be  F{x)  =  0  for  all  x  <  xi_i  and 
F{x)  =  F{x)/L  for  all  x  >  xi-i,  where  x\^l  is  the  productivity  of  the  producer  at  the  (1  —  L)th 
percentile  of  the  productivity  distribution.  This  implies  that  slave  trade  is  equivalent  to  a  first-order 
stochastically  dominating  shift  in  the  distribution  of  productivity  in  the  context  of  our  model,  and  the 
analysis  so  far  implies  that  this  will  raise  coercion. 

Now  using  this  observation  and  the  fact  that  agents  matched  to  more  productive  producers  have 
lower  welfare  (Corollary  3),  the  next  result  shows  that  slave  trade,  even  conditional  on  coercion, 
increases  the  amount  of  coercion  in  equilibrium  and  reduces  agent  welfare.  It  may  even  reduce  overall 
welfare,  despite  the  fact  that  agents  are  now  allocated  to  the  most  productive  producers. 

Proposition  10  Introducing  slave  trade  in  the  baseline  model  increases  coercion  (G)  and  reduces 
agent  welfare.  More  formally,  the  smallest  and  the  greatest  equilibrium  levels  of  coercion  [average 
agent  welfare]  under  slave  trade  are  greater  [smaller]  than  the  smallest  and  the  greatest  equilibrium 
levels  of  coercion  [average  agent  welfare]  under  no  slave  trade.  In  addition,  social  welfare  may  decline 
under  slave  trade.  '  ■      • 

Proof.  Most  of  the  proposition  follows  from  the  preceding  discussion  along  with  Proposition  7.  The 

only  part  to  be  proved  is  that  overall  welfare  may  decline  -under  slave  trade.  We  illustrate  this  using 

a  simple  example,  with  two  types  of  producers  (see  Appendix  C  for  analogs  of  our  earlier  results  in 

the  general  case  where  the  distribution  F  can  have  mass  points).  Let  L  =  1/2,  P  {Q)  =  1  for  all  Q, 

7  =  0,  c  (a)  =  1/  (1  -  a),  x  (g)  =  5^/2,  and  a;  =  xi  or  12  >  ^i  with  equal  probability.   Suppose  that 

Assmnption  1  is  satisfied,  i.e.,  Pxi  >  u  +  c'  (0).  Notably,  there  are  no  general  equilibrium  interactions 

in  this  example,  so  the  only  effect  of  the  slave  trade  is  shifting  mass  1/4  workers  from  producers  with 

productivity  xi  to  producers  with  productivity  X2-  Therefore,  it  suffices  to  show  that  social  welfare  in 

a  producer-worker  pair  with  productivity  xi  is  higher  than  in  a  producer-worker  pair  with  productivity 

X2-   Using  the  fact  that  g  =  a/rj,  the  maximization  problem  (7)  for  a  producer  with  productivity  x 

can  be  written  as 

a\       1  a^ 


max  xa  —  a  [ \-  u  ■         ,       „ 

aG[o,i]  \l  -  a  r] J       2  77 

Now  assuming  in  addition  that  u  =  1,  r?  =  2,  xj  =  3.1  and  X2  =  3.2  (which  clearly  satisfy  Assumption 
1),  we  obtain  that  a(xi)  ~  0.028,  which  yields  producer  welfare  of  approximately  0.0014,  and  worker 
and  total  (utihtarian)  welfare  of  approximately  0.99.  In  contrast,  a(x2)  —  0.052,  which  yields  producer 
welfare  of  0.005,  worker  welfare  of  approximately  0.97,  and  total  welfare  of  approximately  0.98,  which 
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is  lower  than  the  corresponding  social  welfare  for  xi  =  3.1.  This  illustrates  the  possibility  of  slave 
trade  reducing  total  social  welfare  and  completes  the  proof  of  the  proposition.    ■ 

An  implication  of  Corollary  10,  highlighted  by  the  example  used  in  the  proof,  is  that  an  increase 
in  productivity  can  reduce  total  welfare  because  of  its  effect  on  coercion.  In  particular,  increasing 
productivity  x  has  two  offsetting  effects  on  social  welfare.  First,  fixing  a,  increasing  x  by  A  increases 
expected  output  by  Aa  (a  first-order  positive  effect).  Second,  increasing  x  increases  a  and  thus  g.  The 
increase  in  g  has  a  first-order  negative  effect  on  worker  welfare,  but  no  first-order  effect  on  producer 
welfare  by  the  Envelope  Theorem.  This  reasoning  also  suggests  that,  as  in  the  example  in  the  proof, 
an  increase  in  productivity  is  more  likely  to  reduce  social  welfare  when  a  is  small. 

Finally,  in  partial  equilibrium,  even  though  coercion  reduces  the  welfare  of  the  producer  and  the 
agent  combined,  it  naturally  increases  the  welfare  of  the  producer  (who  chooses  the  coercion).  In 
general  equilibrium,  there  is  a  stronger  sense  in  which  coercion  may  be  "inefficient":  an  equilibrium 
with  coercion  may  be  Pareto  dominated  by  an  equilibrium  in  which  coercion  is  banned,  in  the  sense 
that  both  producers  and  agents  are  worse  off  with  coercion.  When  this  happens,  potentially  coercive 
producers  may  become  in  favor  of  abolishing  slavery  or  hmiting  coercion.  The  following  result  pro- 
vides conditions  under  which  banning  coercion  must  be  Pareto  improving  if  P  (■)  declines  sufficiently 
steeply.-'^ 

Proposition  11  Suppose  that  (i)  P  [QL]  satisfies  P{QL)  >  P  if  Q  <  Q  and  P{QL)  <  P  if  Q  >  Q; 
(ii)  if  coercion  is  banned  (rj  =  ooj,  P  =  P{0),  and  u  =  u,  then  for  any  x  all  equilibrium,  contracts 
{a,g){x,P{0),u)  have  a{x,P{0),ii)  <  Q;  (Hi)  if  coercion  is  allowed  (rj  <  ooj,  P  =  P{L),  and 
u  =  u,  then  for  any  x  all  equilibrium  contracts  {a,g)  [x,  P  {L)  ,u)  have  a{x,  P  (L)  ,il)  >  Q.  Then 
there  exists  P*  >  0  such  that,  if  P,  <  P* ,  then  any  equilibrium  with  coercion  banned  Pareto  dominates 
any  equilibrium  with  coercion  allowed.  . 

Proof.  If  coercion  is  banned,  then  ii  =  li  in  any  equilibrium,  so  for  any  x  the  highest  candidate 
equilibrium  contract  is  (a,  g)  (x,  P  (0)  ,u).  If  a  {x,  P  (0) ,  u)  <  Q  in  each  of  these,  then  a  (i,  P,  u)  <  Q 
for  aU  X  in  any  equilibrium,  which  impUes  that  P  >  P  in  any  equilibrium. 

If  coercion  is  allowed,  then  for  any  x  the  lowest  candidate  equiUbrium  contract  is  (a,  g)  {x,  P  (L) ,  u). 
If  a  [x,  P  (0) ,  u)  >  Q  in  each  of  these,  then  a  (x,  P,u)  >  Q  for  all  x  in  any  equilibrium,  which  impHes 
that  P  <  P  in  any  equilibrium. 

If  P  <  P,  then  a  producer's  expected  payoff  is  no  more  than  P.  So  as  P  — >  0,  a  producer's 
expected  payoff  in  any  equilibrium  with  coercion  allowed  goes  to  0,  while  a  producer's  expected  payoff 
in  any  equilibrium  with  coercion  banned  remains  constant,  completing  the  proof.    ■ 


A  numerical  example  illustrating  the  conclusion  of  this  proposition  is  available  from  the  authors  upon  request. 
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5     Extensions 

In  this  section,  we  discuss  several  extensions  of  the  basic  model  presented  so  far.  To  reduce  notation, 
in  the  remainder  of  the  text  we  focus  on  the  partial  equihbrium  results. 

5.1      Ex  Ante  Investments  and  Effort-  vs.   Skill-Intensive  Labor 

A  natural  conjecture  would  be  that,  in  addition  to  the  inefSciencies  associated  with  coercion  identified 
abo\'e,  coercive  relationships  discourage  ex  ante  investments  by  workers.  For  example,  one  could  use 
the  benchmark  model  of  relationship  specific  investments  introduced  in  Grossman  and  Hart  (1986), 
equate  "coercion"  with  the  ex  post  bargaining  power  of  the  employer,  and  thus  conclude  that  coercion 
should  discourage  investments  by  the  worker,  while  potentially  encouraging  those  by  the  producer 
(employer).  Our  model  highlights  that  the  effect  of  coercion  on  investments  is  more  complex.  In 
particular,  coercion  will  encourage  agents  to  undertake  investments  that  increase  their  outside  op- 
tions, while  at  the  same  time  giving  them  incentives  to  reduce  (if  they  can)  their  productivity  within 
the  relationship.  This  is  because,  as  shown  in  Section  3,  greater  outside  options  increase  the  agent's 
paj'off  more  than  one- for  one  (by  also  reducing  coercion),  while  greater  productivity  inside  the  co- 
ercive relationship  reduces  the  agent's  payoff  (by  increasing  coercion).  Conversely,  the  producer  will 
undertake  those  investments  that  increase  productivity  more  than  the  outside  option  of  the  agent. '^^ 
One  implication  of  this  result  is  that  the  presence  of  coercion  may  encourage  agents  to  invest  in  their 
general  human  capital,  though  for  very  different  reasons  than  the  standard  Becker  approach  would 
suggest.  Coercion  also  discourages  them  from  investing  in  relationship-specific  human  capital;  in  fact, 
it  gives  them  incentives  to  sabotage  their  producer's  productive  assets. 

We  model  these  issues  by  adding  an  interim  stage  to  our  game  between  matching  and  the  invest- 
ment in  guns,  g.  At  this  stage,  matched  agents  and  producers  make  investment  decisions,  denoted  by 
i  and  /,  respectively.  For  simplicity,  we  will  analyze  the  case  in  which  such  investment  opportunities 
are  available  only  to  one  party  (either  the  agent  or  the  producer). '^^  Investments  potentially  affect 
both  productivity  x  within  the  relationship  and  the  worker's  outside  option  u,  which  we  now  write  as 
either  x  (i)  and  u  (i),  or  x  (/)  and  u  (/),  depending  on  which  side  makes  the  investment.  Suppose  that 
investment  i  has  costs  (^  (z)  for  the  agent,  while  investment  /  costs  a  producer  C  (-'^)-  We  assume  that 
both  cost  functions  are  increasing  and  convex.  We  also  further  simplify  the  discussion  throughout 
by  focusing  on  the  generic  set  of  parameters  for  which  u'  <  0  in  any  equilibrium  contract,  so  that 
we  can  replace  (7)  by  (8)  and  ensure  differentiability  of  g  in  parameters,  and  by  imposing  two  ad- 
ditional assumptions:  (1)  The  equilibrium  contract  is  unique  for  all  x.  (2)  sign{Px'  (i)  —  v!  (z))  and 
sign{Px'  (7)  —  u'  (/))  do  not  depend  on  i  and  /,  i.e.,  each  investment  always  has  a  larger  effect  on 
either  Px  or  for  u,  and  thus  either  sign  {Px'  (?)  —  u'  (i))  <  0  for  all  i  or  sign  [Px'  [i)  —  v!  (i))  >  0  for 


'Indeed,  if  the  producer's  ex  ante  investment  decreases  the  agent's  outside  option  and  has  no  effect  on  productivity, 
it  is  simply  a  relabeling  of  guns. 

''^This  implies  that  we  are  abstracting  from  indirect  effects  resulting  from  the  interaction  of  investments  by  agents  and 
producers. 
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all  i,  and  likewise  for  sign  {Px'  (/)  —  v!  (/)).  These  last  two  assumptions  are  imposed  only  to  facilitate 
the  exposition.  If  the  first  assumption  is  relaxed,  one  has  to  take  care  of  the  situations  in  which  a 
change  in  the  investment  of  the  agent  can  induce  a  change  in  equilibrium  contract,  though  this  does 
not  affect  any  of  our  comparative  statics."^  Thus  this  assumption  is  without  loss  of  much  generality. 
The  second  assumption  enables  us  to  clearly  separate  two  different  cases,  which  will  have  different 
implications. 

Let  us  first  analyze  the  situation  in  which  only  agents  have  investment  opportunities.  As  a  bench- 
mark, note  that  if  there  is  no  coercion  (i.e.,  t]  =  oo),  a  matched  worker  anticipates  receiving  expected 
utihty  u{i)  —  Q  (i)  after  choosing  investment  i,  and  therefore  chooses  i  to  solve 

max'u(i)  -  C,{i) .  (12) 

Returning  to  the  analysis  in  Section  3,  it  is  clear  that  when  there  is  the  possibility  of  coercion  (recall 
that  guns,  g,  is  chosen  after  the  agent's  investment,  i),  the  agent  will  receive  utility  u{i)  —  g  (i)  —  C (Oi 
and  she  therefore  chooses  i  to  solve 

inaxu{i)  —  g  {{)  —  (^{i) .  (13) 

To  characterize  the  solution  to  this  program,  we  need  to  determine  how  the  choice  of  guns  by  the 
producer  responds  to  changes  in  i,  i.e.,  the  form  of  the  function  g{i).  Equation  (10)  implies  that  g  (i)  = 
{x')~  {^{i)  /v)'-:  under  the  assumption  that  the  equilibrium  contract  is  unique,  we  can  differentiate 
this  to  obtain 

^'(0  =  -4^.  -         '  (14) 

VX    [9 (0) 

Next,  note  that  the  producer's  expected  profit  in  an  equilibrium  contract  may  be  wi'itten  as 

a  {Px  (z)  —  u  {i))  —  a{l  —  a)c'  (a)  —  ac  (a)  +  ag  —  rjx  {g)  ■       '  '    7  (15) 

Therefore,  sipn  (a' (i))  =  sign  (Px' (i)  —  u' (t)),  and  thus  •;7  i,    t^    •   ■    ■ 

sign  (g'  (i))  =  sign  (Px'  (i)  —  u'  (i))  . 
Combining  this  with  (12)  and  (13)  then  immediately  yields  the  following  result  (proof  in  the  text): 


^°In  particular,  if  two  (or  more)  equilibrium  contracts  existed,  the  producer  could  implicitly  threaten  the  agent  with 
a  switch  from  one  equilibrium  to  the  other.  We  do  not  find  such  threats  plausible,  since  they  rely  on  the  indifference  of 
the  producer  between  several  options  (such  switches  can  never  be  strict  best  responses).  Moreover,  they  can  only  arise 
when  there  exist  multiple  equilibrium  contracts  for  an  interval  of  values  of  the  agent's  investments,  which  cannot  be  the 
case  in  our  framework. 

In  any  case,  it  can  be  shown  that  even  if  there  are  multiple  equilibrium  contracts,  all  of  the  results  in  this  subsection 
continue  to  hold,  though  the  analysis  needs  to  be  more  involved,  because  we  cannot  use  the  differentiability  of  equilibrium 
eflFort  and  coercion  (a(i)  and  g{i))-  Instead,  we  would  first  have  to  show  that  an  equilibrium  cannot  be  at  the  point 
of  multiplicity  if  the  producer  used  the  multiplicity  as  an  implicit  threat,  because  a  small  change  in  i  would  have  a 
noninfinitesimal  effect  on  agent  welfare,  and  then  given  this  result,  we  could  develop  the  same  argument  as  in  the  text. 
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Proposition  12  Equilibrium  investment  by  the  agent,  i'-' ,  under  coercion,  given  by  the  solution  to 
(13),  is  greater  than  the  noncoercion  investment,  i    ,  given  by  the  solution  to  (12),  if  and  only  if 

sign  [Px' {i)  -  u' (i))  <  0.  :      ',       , 

Proposition  12  implies  that  under  coercion  agents  will  underinvest  (relative  to  noncoercion)  in  tasks 
that  increase  their  within-relationship  productivity  relative  to  their  outside  option.  The  intuition  is 
straightforward:  when  the  agent  becomes  more  productive,  the  producer  chooses  a  contract  that 
induces  greater  effort,  which  also  comes  with  greater  coercion.  This  reduces  agent  welfare,  so,  all 
else  equal,  the  agent  would  like  to  avoid  this.  Counteracting  this  effect  is  the  fact  that  investment 
may  decrease  coercion  by  increasing  the  agent's  outside  option.  Thus,  if  we  focus  on  investments 
that  primarily  affect  the  outside  option  of  the  agent,  coercion  increases  investment.  Conversely,  when 
investments  are  most  relevant  for  within-relationship  productivity,  the  possibility  of  coercion  reduces 
investments. 

An  interesting  implication  of  Proposition  12  relates  to  the  argument  by  Fenoaltea  (1984),  that 
slavery  is  often  observed  in  "effort-intensive"  tasks,  but  not  in  "care-intensive"  tasks  (see  also  Enger- 
man  and  Sokoloff,  1998).  Fenoaltea  attributes  this  association  to  the  psychological  difficulty  of  using 
punishments  to  motivate  care.  Our  result  provides  an  alternative  explanation,  under  the  assumption 
that  "care-intensive"  tasks  are  those  where  relationship-specific  investment  by  the  worker  are  more 
important — in  this  interpretation  a  corresponds  to  "effort,"  while  i  is  associated  with  "care,"  and  we 
have  in  mind  tasks  where  Px'  (■)  —  u'  (•)  >  O.*^*^ 

Another  imphcation  is  that  if  the  agent  had  the  means  to  "sabotage"  the  productivity  of  her 
employer,  she  would  like  to  do  so,  because  by  reducing  this  productivity,  she  would  reduce  both  the 
amoimt  of  effort  that  she  is  induced  to  exert  and  the  degree  of  coercion  that  she  is  subjected  to. 

Next,  consider  the  situation  where  only  producers  undertake  investments.  Without  coercion,  a 
producer  who  chooses  /  and  a  receives  expected  payoff  • 

a(Px{I)-u{I))-a{l-a)c'{a)-ac{a)-C{I),  (16) 

while  with  coercion  he  receives  expected  payoff 

a{Px{I)  -  u{I)) -a(l-a)c' {a) -ac{a)  +  ag-r]x{9) -({!)■  '  (17) 

Clearly,  the  producer  will  choose  /  =  0  if  Px'  (I)  —  u'  [I]  <  0,  regardless  of  whether  we  allow  coercion: 
this  is  a  version  of  the  standard  result  in  human  capital  theory  that  producers  never  provide  general 
skills  training.  If,  on  the  other  hand,  Px'  (7)  -  u'  (/)  >  0,  then  with  the  same  arguments  as  in  Section 
3,  it  can  be  verified  that  (17)  is  supermodular  in  [I,a,g,  -77).  Now  the  comparison  between  producer 


■""This  is  consistent  with  Fenoaltea's  discussion,  whicli  empliasizes  the  association  between  "care-intensive"  and  slcill- 
intensive  tasks.  For  example,  he  notes  that  uncoerced  galley  crews  were  sometimes  used  because  "the  technically 
superior  rowing  configuration  did  require  skilled  oarsmen"  (1984,  p.  642),  and  that  "at  least  the  skilled  branches  of 
factory  production"  were  care-intensive  (p.  654). 
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investment  under  coercion  and  noncoercion  can  be  carried  out  by  noting  that  (16)  is  equivalent  to  (17) 
as  77  — ♦  oo.  Supermodularity  of  (17)  then  immediately  gives'  the  following  result  (proof  in  the  text): 

Proposition  13  Equilibrium  investment  by  the  producer,  /'",  under  coercion,  given  as  the  solution 
to  (17),  is  greater  than  the  noncoercion  investment,  I       ,  given  as  the  solution  to  (16),  if  and  only  if 

sign  {Px'  (/)  -  v!  (/))  >  0. 

Proposition  13  has  a  similar  interpretation  to  Proposition  12:  investment  incentives  are  determined 
by  whether  they  affect  productivity  within  the  relationship  or  the  outside  option  by  more.  In  contrast 
to  the  agent,  the  producer  has  greater  incentives  to  invest  when  relationship-specific  productivity 
increases  by  more  than  the  outside  option  of  the  worker.  The  general  principle  here  is  related  to  the 
result  in  Acemoglu  and  Pischke  (1999)  that  employers  will  invest  in  general  human  capital  when  there 
is  "wage  distortion,"  so  that  these  investments  increase  worker  productivity  inside  the  relationship  by 
more  than  their  outside  wage. 

5.2      Collective  Coercion  ,  - 

In  many  historical  episodes,  a  group  of  producers  exercises  coercion  collectively,  using  local  or  national 
institutions,  law  enforcement  agencies,  or  various  groups  of  paramilitaries.  In  this  subsection,  we  show 
that  our  main  comparative  static  results  generalize  to  an  enviroimient  in  which  coercion  is  undertaken 
collectively  by  a  finite  number  of  (matched)  producers.  Formally,  we  assume  that  there  are  N  <  00 
producers  and  agents  matched  together,  and  that  the  environment  is  identical  to  the  partial  equilibrium 
model  of  Section  3  with  the  sole  modification  that  the  level  of  coercion  g  faced  by  agent  i  is 


N 

where  gj  is  the  "contribution"  of  producer  j  to  the  collective  coercion  technology.  Clearly,  this  speci- 
fication makes  coercion  a  privately  provided  public  good  for  the  producers. 

The  following  proposition  can  be  established  using  the  same  arguments  leading  to  Propositions  1 
and  2,  and  we  omit  the  proof  for  brevity. 

Proposition  14  Suppose  Px  >  u  -]-  c'  (0).    Then  in  any  equilibrium  contract  each  producer  i  (of  type 
x)  chooses  tti  >  0  and  g,  >  0,  such  that  .  .     -  ,•   ;  , 


{o-i,9i)     €     arg         max         Pxdi  —  di 

(a,,sOelO,l]xR+ 


;i  -  a,)  c'  {d,)  +  c  (a,)  +  ^  -  —  ^ 5^  -  — < 


(1  -  a,) 


-d^c'  {d^)+c  (aj)  +u-  —^9j  -  —h 


j#t 


N 
VX{9t), 


(18) 


38 


with  w^  =  p^  =  0,  w^  =  {1  -  a)  c'  (a)  +  c{a)  +  u  -  g  >  0,  and  p'  =  ac'  {a)  -  c{a)  -  u  +  g  >  0,  where 

Moreover,  the  set  of  equilibrium  contracts  {a{x),g{x))  of  a  producer  of  type  x  forms  a  lattice, 
with  greatest  and  smallest  elements  (a"*"  (x)  ,g"^  {x))  and  {a~  {x)  ,g~  (x)).  The  extremal  equilibrium 
contracts  (a"*"  (x)  ,5^  (x))  and  (a~  (x)  ,5""  (x))  are  increasing  in  x  and  P  and  decreasing  in  u  and 
?/,  and  thus  (extremal  equilibrium)  coercion  levels,  g'^  and  g~ ,  are  increasing  in  P  and  F  {■),  and 
decreasing  in  u  and  77.         '         ■'■  ,'''■''     ■  ■  ■'     '•.'■■'    .      ;         •    ; '        --'  ■  ■- '  ,-■'■■ 

Intuitively,  the  maximization  problem  (18)  is  identical  to  (7)  with  w  —  -^  12j^i9j  replacing  u  and 
X  {Ngi)  replacing  x  idi)-  This  is  the  reason  why  all  of  om-  comparative  statics  (Proposition  2)  continue 
to  hold  with  collective  coercion  (and  are  stated  as  part  of  Proposition  14).  However,  because  coercion 
is  a  public  good  for  the  producers,  free-riding  among  them  imphes  that  less  coercion  will  used  in 
equilibrium  than  if  the  producers  could  solve  their  collective  action  problem. 

This  proposition  is  useful  in  highlighting  that  our  analysis  can  provide  the  basis  of  richer  theories 
of  coercion,  where  the  collective  nature  of  coercion  and  its  institutional  foundations  are  also  explicitly 
modeled,  and  our  comparative  static  results  are  likely  to  hold  in  these  more  general  environments. 

5.3      Limited  Commitment  to  Payments 

A  major  simplifj'ing  assumption  of  our  analysis  so  far  is  that  the  producer  can  commit  to  any  output- 
contingent  payment-punishment  plan  {w^,p^).  There  is  a  natural  tension  raised  by  this  assumption, 
however.  Contingent  payments  are  often  difficult  to  specify  in  contracts,  which  has  motivated  the 
relational  contracts  literature  (see,  e.g.,  Levin,  2003).  This  issue  is  particularly  problematic  in  our 
context,  since  the  producer  has  guns  and  the  capacity  to  coerce,  and  can  use  these  to  renege  on  his 
promises. ^^  In  particular,  a  producer  with  more  guns  may  face  greater  difficulty  in  committing  to  a 
contract,  creating  a  form  of  holdup  problem. 

In  this  subsection,  we  provide  a  brief,  reduced-form  analysis  of  this  producer  holdup  problem.  We 
assume  that  the  producer  with  guns  equal  to  g  can  renege  on  the  payments  promised  to  the  agent 
at  a  cost  h{g).  For  example,  this  may  reflect  the  fact  that  the  producer  needs  to  threaten  or  apply 
violence  in  order  to  renege  on  the  promised  payment,  which  wiU  typically  be  costly.  This  possibility 
introduces  a  natural  holdup  constraint,  whereby  if  either  w'^  or  it''  exceed  h[g)^  then  the  producer  will 
exercise  the  option  of  using  coercion  in  order  to  avoid  payment.  Since  this  will  be  perfectly  anticipated 
by  the  agent,  no  payment  above  h{g)  is  credible  or  will  be  observed  in  equilibrium.  Since  equilibrium 
contracts  will  stiU  involve  w^  <  w'^,  the  relevant  constraint  is 

w''<h{g).  (H) 


'''Interestingly,  historical  coercive  labor  "contracts"  appear  to  have  been  have  "informal"  or  "relational,"  even  when 
coercion  was  legal  (e.g.,  Fogel  and  Engerman,  1974).  A  full  analysis  of  this  issue  may  necessitate  an  infinitely  repeated 
game  between  the  producer  and  the  agent.  Nevertheless,  the  static  problem  presented  here  is  important,  since  we  know 
from  the  relational  contracting  literature  (especially  Levin,  2003),  that  optimal  relational  contracts  between  risk-neutral 
parties  often  closely  resemble  optimal  static  contracts  subject  to  payment  constraints  such  as  (H). 
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To  reflect  the  fact  that  a  producer  with  more  guns  finds  it  more  difficult  to  commit  to  paying  the 
agent,  we  assume  throughout  this  section  that 

h'  [g]  <  0  for  all  g 

(we  also  maintain  the  convenient  assumption  that  h{-)  is  differentiable). 

The  rest  of  the  relationship  between  matched  producer  and  agent  is  the  same  as  in  Section  3,  and 
thus,  with  analogy  to  our  analysis  there,  an  equilibrium  contract  is  a  solution  to  the  maximization 
of  (1)  subject  to  (ICo),  (IR-o),  and  now  (H).  If  this  last  constraint  is  slack,  the  solution  in  Section  3 
applies. 

Here  we  therefore  focus  on  the  case  in  which  (H)  is  binding,  and  thus  w'^  =  h  (g).  In  that  case,  we 
can  combine  (H)  with  (ICq)  and  (IRq)-  A  possible  compHcation  is  that,  in  the  presence  of  (H),  it  is  no 
longer  the  case  that  u;'  =  0  necessarily  (because  the  producer  may  need  to  pay,  rather  than  punish, 
following  low  output  in  order  to  satisfy  the  participation  constraint  of  the  agent  without  paying  a  wage 
rate  that  would  violate  (H)).  Nevertheless,  a  similar  analysis  to  that  in  Section  3  can  be  developed. 
First,  note  that  the  first-order  approach  still  applies  and  thus,  using  the  fact  that  w'^  =  h  {g)  >  0,  we 
can  write  a  version  of  (ICi)  as  '  :  -      --  '  . 

h{g)-(w'-p')=c'(a).   :„-7::    .;,  ,.;.;.., 
This  enables  us  to  write  (IRo)  as 

hig)+g>u+(l-a)c'{a)+c{a),'  '  (19) 

which  defines  the  set  of  feasible  (a, 5),  and  w^  =  \h{g)  —  c'(a)]_,_.  A  useful  observation  is  that  if 
h'{-)  >  —1  (i.e.,  h'{g)  >  —1  for  all  g),  then  the  left-hand  side  is  increasing  in  g,  and  thus  (19)  forms 
a  lattice  (or  more  appropriately  a  sublattice  of  R^).  Conversely,  if  h'[-)  <  —1,  then  the  left-hand  side 
is  decreasing  in  g,  and  (19)  forms  a  lattice  with  the  reverse  order  on  g  (i.e.,  in  {a,  —g)). 

Substituting  w'^  =  h  (g)  also  into  (1)  and  using  the  fact  that  w'  =  [c'  (a)  —  h  {g)]^,  we  obtain  that 
an  equilibrium  contract  is  now  a  solution  to  ■  '  i     : 

max        a{Px^h{g))-{l-a)[h{g)-c'{a)]     -i^xig)  (20) 

(a,g)e[0,lJxR+  _  ,  ^  , ,,  ,         , 

subject  to  (19). 

It  is  straightforward  to  verify  that  the  maximand  in  (20)  is  still  supermodular  in  {a,g),  by  our 
assumption  that  h'  (•)  <  0.  When  /i'(-)  >  —1,  (19)  forms  a  lattice,  so  equilibrium  contracts  are  still 
solutions  to  a  supermodvilar  maximization  problem  in  {a,g).  This  analysis  therefore  establishes  the 
following  (proof  in  the  text):^^  .  .1 

Proposition  15  Consider  the  problem  with  producer  holdup  and  suppose  that  (H)  binds  at  every 
equilibrium  contract  and  that  h'  {■)  >  —1.  Then  the  set  of  equilibrium  contracts  is  a  lattice  with  the 
usual  order  on  M^,  and  the  weak  version  of  the  comparative  static  results  of  Proposition  2  apply. 


''"Note  that  Proposition  15  only  establishes  weak  comparative  statics,  while  Proposition  2  established  strict  comparative 
statics. 
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Intuitively,  with  holdup,  an  increase  in  g  creates  two  effects.  The  first  is  a  direct  effect  on  the 
worker's  participation  constraint  as  in  our  baseline  model  of  Section  3.  The  second  is  an  indirect  effect 
working  through  the  fact  that  higher  g  now  limits  the  amounts  that  the  worker  can  be  paid  when 
successful.  When  h'  {■)  >  —1,  the  direct  effect  outweighs  the  indirect  effect,  and  thus  increasing  g 
continues  to  relax  (19);  consequently,  the  comparative  statics  of  Section  3  still  apply. 

The  situation  is  different  when  h'  (■)  <  —1,  because,  as  noted  above,  the  constraint  (19)  now  forms 
a  lattice  with  the  reverse  order  on  g,  implying  that  higher  g  will  typically  require  a  lower  a.  In  this 
case,  the  comparative  static  results  of  Section  3  no  longer  apply.  To  see  this  most  clearly,  suppose 
that  the  equihbrium  contract  is  unique  and  involves  w''  =  h  [g)  -  c'  (a)  >  0.  In  this  case,  (20)  can  be 
wTitten  as 

■_  max        aPx~h  {g)  +  (1  -  a)  c'  (a)  -  rjx{g) .  (21) 

(a,g)e[0,l]xR+ 

subject  to  (19),  and  small  changes  in  parameters  will  lead  to  small  changes  in  the  solution  to  this 
maximization  problem  (which  still  induces  w'-  >  0).  In  this  case,  the  objective  function  is  weakly 
supermodular  (or  submodular  as  well  as  supermodular)  and  the  constraint  is  a  lattice  in  (a,  —g)). 
Therefore,  in  this  case  the  opposite  of  the  comparative  statics  of  Section  3  will  hold — in  the  sense  that 
coercion  wiU  be  decreasing  in  x  and  P,  while  effort  will  be  increasing  in  the  cost  of  guns,  tj.  Intuitively, 
because  now  h'  {■)  <  —1,  the  left-hand  side  of  (19)  is  decreasing  in  g,  and  thus  increasing  coercion 
tightens  this  constraint — an  increase  in  g  necessitates  significantly  lower  wages  to  satisfy  (H)  and  this 
more  than  offsets  the  direct  effect  of  greater  coercion  on  the  agent's  participation  constraint. 

5.4      Interim  Participation  Constraints 

Another  implicit  assumption  of  our  analysis  so  far  has  been  that  the  agent  can  run  away  immediately 
after  the  match  (i.e.,  after  the  "contract  offer"  of  the  producer),  but  not  subsequently.  An  alternative 
perspective  would  be  that  the  agent  could  also  attempt  to  run  away  after  observing  an  unsuccessful 
outcome,  i.e.,  precisely  when  she  will  receive  punishment  rather  than  reward.  This  additional  option  to 
run  away  would  introduce  one  more  "IR"  or  participation  constraint  for  the  agent,  which  we  refer  to  as 
the  interim  participation  constraint.  The  presence  of  such  an  intern  participation  constraint  introduces 
a  potential  "useful"  role  of  coercion.  Intuitively,  the  agent  may  prefer  ex  ante  to  be  unable  to  run 
away  after  output  is  realized;  this  is  similar  to  the  logic  that  operates  in  models  such  as  Chwe's,  where 
coercion  does  not  affect  the  ex  ante  participation  constraint  but  does  enable  punishment.  Interestingly, 
however,  we  will  see  that,  under  fairly  weak  assumptions,  this  effect  is  dominated  by  the  negative 
impact  of  coercion  on  welfare. 

Formally,  in  this  subsection  we  assume  that  the  agent  will  rvm  away  after  the  reahzation  of  y  if 
her  (equilibrium-path)  continuation  utility  is  below  u  —  ^  {g),  where  u  is  the  outside  option  of  the 
agent  introduced  above  and  ^  {g)  is  an  increasing  function  of  guns;  the  interpretation  of  this  is  that 
the  producer  can  infhct  punishment  $  (g)  if  the  agent  runs  away  after  the  reahzation  of  y,  just  as 
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she  can  inflict  punishment,  g  if  the  agent  runs  away  before  the  reahzation  of  y}^  This  consideration 
therefore  introduces  an  "intei'im  IR  constraint,"  (IIR),  in  addition  to  the  "ex  ante  IR  constraint," 
(IRo),  in  Section  2.  This  imphes  that  both  w^  —  p^  and  ui'  —  p'  must  be  greater  than  u—^  {g),  though 
natm-ally  only  the  constraint  w'  —p^  >  u  —  ^{g)  can  be  binding  in  an  equilibrium  contract.  Therefore, 
the  considerations  discussed  in  this  subsection  introduce  the  additional  constraint 

w^  -p^  >u-^{g).  (IIR) 

Now,  by  an  argument  similar  to  that  in  Section  3,  an  equilibrium  contract  is  a  solution  to  the  maxi- 
mization of  (1)  subject  to  (ICo),  (IRo),  and  now  (IIR). 

Suppose  first  that  $  (g)  is  sufficiently  large  for  all  g.  It  is  then  clear  that  (IIR)  will  always  be 
slack  and  the  producer's  problem  is  identical  to  that  in  Section  3,  and  Proposition  2  applies.  In  this 
subsection,  we  thus  suppose  instead  that  $  (g)  <  g.  This  case  turns  out  to  be  tractable  because  now 
(IRo)  will  be  slack.  This  is  shown  by  the  following  lemma: 

Lemma  5  Consider  the  model  with  the  interim  participation  constraint  and  suppose  that  $  (g)  <  g. 
Then  (IRq)  is  slack  and  (IIR)  binds. 

Proof.  The  first-order  approach  still  applies,  so  we  can  replace  (ICq)  with  (ICi).  By  (IIR),  w^  —  p'  > 
u—  <^{g).  Substituting  this  into  (ICi)  gives  w^  —  p'^  >  c'  [a) +  u-  ^  (g).  Combining  these  inequalities 
yields  .,,       .,  _  .,   .,,,.-    ;.  ,        .    .        ■  -  ..     ■ 

a  (w'' ~  p'^']  +  {I  -  a)  (w' -  p']  -  c  {a)     >     u  -  ^  (g)  +  ac' (a)  -  c{a) 

>  u-^{g) 

>  u-  g,  ■  ._  -      , 

where  the  second  inequality  follows  by  convexity  of  c  (a)  and  the  fact  that  c  (0)  =  0,  and  the  third 
follows  by  the  assumption  that  $  (g)  <  g.  This  chain  of  inequalities  imphes  (IRo).  Given  that  (IRq) 
is  slack,  the  fact  that  increasing  p'  relaxes  (ICi)  implies  that  (IIR)  must  bind.    ■ 

Provided  that  ^  (g)  <  g,  Lemma  5  then  allows  us  to  substitute  (ICi)  into  (IIR),  which  implies 
that  equilibrium  contracts  are  characterized  by  .•     '  .:.i    ■      ■>  ,      :  ■     :   .i-   - 

•  max        aPx-a[c' (a) +u-^{g)]-{l -a)[u~^{g)]^- rixig).  (22) 

(a,5)G[0,l]x!R+  ■' + 

(22)  is  supermodular  in  (a,  g,  x,  P,  —u,  —rj),  so  comparative  statics  go  in  the  same  direction  as  in  our 
baseline  model,  though  they  may  hold  with  weak  rather  than  strict  inequahties:*'' 


'  For  example,  g  may  be  the  pain  that  the  producer  can  inflict  on  a  worker  if  she  runs  away  on  the  first  day  on  the  job, 
while  $  {g)  may  be  the  pain  that  the  producer  can  inflict  on  the  worker  once  she  has  set  up  a  home  on  the  producer's 
plantation. 

If  u  —  h{g)  >  0,  a  and  g  are  only  weakly  complementary  in  the  producer's  objective  (i.e.,  the  problem  is  both 
supermodular  and  submodular).  In  this  case,  a  change  in  one  of  i,  P,  u,  and  rj  will  only  affect  the  smallest  and  greatest 
equilibrium  contract  values  of  either  a  or  g,  leaving  the  smallest  and  greatest  equilibrium  contract  values  of  the  other 
variable  constant. 
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Proposition  16  //  ^  (g)    <    g  for  all  g,    then   the  greatest   and  smallest   equilibrium   contracts 
(a"*"  (x)  ,£/■'"  (x))  and  {a~  (x)  ,g~  (x))  are  nondecreasing  in  x  and  P  and  nonincreasing  in  u  and  rj. 

The  intuition  for  tliis  is  tiiat,  regardless  of  wtietlier  (IRq)  or  (IIR)  are  binding,  increasing  g  re- 
duces tlie  amount  tiiat  tlie  producer  must  pay  tiie  agent  after  higii  output  is  realized,  leading  to 
complementarity  between  effort  and  coercion  as  in  Section  3. 

While  the  possibility  that  (IIR)  rather  than  (IRo)  may  be  binding  does  not  affect  our  comparative 
static  results,  it  does  suggest  that  coercion  may  play  a  socially  useful  role.  In  particular,  ex  post 
punishments  may  be  useful  in  providing  incentives  to  an  agent  who  is  subject  to  hmited  liability,  and 
(IIR)  limits  the  use  of  such  punishments;  one  may  then  conjecture  that  increasing  g  may  increase 
social  welfare  if  it  relaxes  (IIR).  We  next  show  that  this  conjecture  is  not  correct.  To  see  this,  note 
that  if  (IIR)  is  binding  and  coercion  is  not  allowed  (i.e.,  rj  =  oo),  then  the  producer's  problem  becomes 

•       ,  ^  max  aPx  —  ac' (a)  ~  au  —  (I  —  a)  \u\,,        ,    ■    , 

and  (utilitarian)  social  welfare  is  ■-  ■  ■ 

SW     =  max  aPx  —  ac'  (a)  —  au  —  {1  —  a)  [u],  +  u; 

while  with  coercion  the  producer's  problem  is  given  by  (22),  and  social  welfare  corresponding  to  an 
equilibrium  contract  involving  {a,g)  is 

SW'^  =  aPx  -  ac'  (a)  -  au  +  a$  (g)  -  (1  -  a)  [u  -  $  {g)]+  -  VX  (s)  +  ^i  "^  P- 

If  ^  [g)  <  g  for  all  g,  then  it  is  clear  that  SW^  >  SW'-' ,  with  strict  inequality  if  g  >  0.  Thus,  coercion 
reduces  social  welfare,  if  ^  [g)  <  g.  Formally,  we  have  the  following  (proof  in  text): 

Corollary  9  Suppose  that  ^  {g)  <  g  for  all  g,  and  let  SW'-'  be  social  welfare  corresponding  to  an 
equilibrium  contract  with  coercion.    Then  SW^  >  SW'^ ,  with  strict  inequality  tf  g  >  0. 

The  intuition  for  Corollary  9  follows  by  comparing  (7)  to  (22).  Both  (7)  and  (22)  are  supermodular 
in  (a,  g) — because  of  the  term  ag  in  (7)  and  the  term  a$  (g)  (or,  if  u;'  >  0,  <I>  (g))  in  (22).  Our  result  in 
Section  3  that  coercion  reduces  social  welfare  exploits  the  fact  that  coercion  enters  into  worker  welfare 
through  the  term  —g,  which  is  always  larger  in  absolute  value  than  ag.  If  $  (g)  <  g,  then  —g  is  larger 
in  absolute  value  than  a$  (g)  and  $  (g)  as  well,  so  the  same  argument  as  in  Section  3  implies  that 
coercion  reduces  social  welfare. 

Finally,  we  note  that  we  have  not  analyzed  the  case  in  which  both  ^  (g)  >  g  for  some  g  and  (IIR) 
is  binding.  In  such  cases,  Corollary  9  does  not  apply  and  thus  does  not  rule  out  the  possibility  that 
coercion  may  sometimes  be  good  for  social  welfare.  Nevertheless,  the  combination  of  Corollary  9, 
which  imphes  that  when  $  {g)  <  5,  coercion  reduces  social  welfare,  and  Corollary  5,  which  implies 
that  coercion  also  reduces  social  welfare  when  $  (g)  is  sufficiently  large  to  make  (IIR)  slack,  suggests 
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that  any  example  in  which  coercion  increases  social  welfare  will  be  somewhat  special.  This  is  the  basis 
of  our  claim  that,  under  fairly  weak  assumptions,  our  general  welfare  res\ilt  that  coercion  reduces 
utihtarian  social  welfare  appears  robust  to  the  presence  of  interim  participation  constraints. 

5.5      Substitutability  Between  Effort  and  Productivity 

All  of  our  results  so  far  have  been  predicated  on  the  complementarity  between  productivity  and  effort. 
This  is  natural:  greater  effort  increases  the  probability  of  success,  and  success  is  more  valuable  to 
producers  with  greater  probability.  Nevertheless,  one  can  imagine  more  general  setups  where  higher 
productivity  producers  may  have  less  need  for  effort.  In  this  subsection,  we  briefly  discuss  how  our 
analysis  can  be  adapted  to  this  case,  even  though  we  find  it  less  natural  and  suspect  that  it  is  also  less 
relevant  empirically.  To  allow  for  this  possibility,  suppose  that,  instead  of  ax,  the  expected  output  of  a 
producer  of  tj'pe  x  when  the  agent  exerts  effort  a  is  given  by  a  twice  continuously  differentiable  function 
y{a,x).  Suppose  also  that  the  producer  can  only  condition  payments  and  punishments  on  a  binary 
signal,  which  equals  High  with  probability  a  and  equals  Low  otherwise. ^^  With  this  formulation, 
given  (a,  g,  w'^,  w^) ,  the  principal's  expected  payoff  is 

Py  (a,  x)  -  aw^  -  (1  -  a)  iw'  -  rjx  (g)  ■ 

It  can  now  be  verified  that  if  d'^y  (a,  x)  /dadx  >  0  for  all  a  and  x,  then  productivity  and  effort 
continue  to  be  complements,  the  problem  of  characterizing  equihbrium  contracts  is  supermodular  in 
{a,g,x),  and  all  of  our  results  generalize.  Suppose,  instead,  that  d'^y  {a,x)  /dadx  <  0  for  all  a  and 
X,  so  that  productivity  and  effort  are  substitutes.  Equilibrium  contracts  are  still  characterized  by  the 
maximization  problem  ^,      ,    ,,  ,,     .., 

•  max        Py(a,x)-a  \(1  -  a)  c'  {a)  +  c  (a)  +  u  -  g]  ,-  (1  -  a)  \-ac'  (a)  +  c  (a)  +  u  -  g]     -  r]x  (g) , 

(o,g)e[0,l]xM+  J+  L  j  + 

-      •         .      .-..■         •         ,     :  -     -■        .    :  ..,        '  (23) 

which  is  now  supermodular  in  {a.g,  —x,P,  —u,  —rj).  Thus,  the  only  difference  between  (7)  and  (23), 
for  the  purposes  of  comparative  statics,  is  that  x  is  complementary  with  (a,  g,  P,  —u.  —rj)  in  (7),  but 
is  substitutable  with  {a,g,P,  —u,—ri)  in  (23).  This  discussion  thus  establishes  the  following  (proof 
omitted): 

Proposition  17  Suppose  that  d'^y  [a,  x)  /dadx  <  0  for  all  a  and  x.  Then  the  set  of  equilibrium  con- 
tracts for  a  producer  of  type  x  forms  a  lattice,  and  the  greatest  and  smallest  equilibrium  contracts 
(a"*"  (x)  ,5+  (x))  and  {a"  (x)  ,g~  (x))  are  decreasing  in  x.  The  remaining  comparative  statics  in  Propo- 
sition 2  apply.  ^  -         ■  •■    .  •  ,; 


■"''Implicitly,  this  formulation  assumes  that  the  principal  cannot  condition  payments  and  punishments  on  the  actual 
realization  of  output.  While  this  is  not  very  appealing,  it  is  the  simplest  way  to  introduce  the  possibility  that  effort  and 
productivity  may  be  substitutes,  vifhich  is  our  main  focus  in  this  subsection. 
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In  this  case  workers  are  better  off  when  matched  with  more  productive  producers,  as  these  pro- 
ducers demand  less  eflFort  and  thus  use  less  coercion.  One  implication  is  that,  in  contrast  to  the 
analysis  in  subsection  4.6,  slave  trade,  conditional  on  coercive  labor  relations,  now  increases  agents' 
welfare  rather  than  reducing  it.  Having  noted  this  possibility,  we  still  emphasize  that,  to  us,  the  case 
with  complements  appears  both  more  natural  and  empirically  more  plausible,  though  distinguishing 
between  the  two  cases  is  ultimately  an  empirical  matter.        ,,■';■ 

6     Concluding  Remarks 

Standard  economic  models  assume  that  transactions  in  the  labor  market  are  "free" .  For  most  of  human 
history,  however,  the  bulk  of  labor  transactions  have  been  "coercive,"  in  the  sense  that  the  threat  of 
force  was  essential  in  convincing  workers  to  take  part  in  the  employment  relationship,  and  in  accepting 
the  terms  of  employment.  The  small  existing  hterature  on  slavery  and  forced  labor  does  not  model 
what  we  view  as  the  fundamental  effect  of  coercion  on  labor  transactions — coercion  makes  workers 
accept  employment  relations  (terms  of  employment)  that  they  would  otherwise  reject.  This  paper 
provides  a  tractable  model  incorporating  this  feature  and  uses  it  to  provide  a  range  of  comparative 
static  results  useful  for  understanding  when  coercive  labor  relations  are  more  likely  to  arise  and  what 
their  welfare  implications  are.  -• 

At  the  heart  of  our  model  is  the  principal- agent  relationship  between  a  potentially  coercive  producer 
and  an  agent.  Coercion,  modeled  as  investment  by  the  producer  in  "guns,"  affects  the  participation 
constraint  of  the  agent.  We  first  analyzed  this  principle-agent  problem  and  derived  partial  equilibrium 
comparative  statics,  and  then  embedded  it  in  a  general  equihbrium  framework  to  study  the  relation- 
ship between  labor  scarcity/abundance  and  labor  coercion.  Both  the  partial  and  general  equilibrium 
analyses  are  made  tractable  because  of  the  supermodularity  of  the  maximization  problem  character- 
izing equihbrium  contracts.  This  supermodularity  is  not  only  mathematically  convenient,  but  also 
economically  central:  coercion  covaries  positively  with  the  amount  of  effort  that  the  principal  wishes 
to  extract  from  the  agent.  Intuitively,  greater  effort  implies  that  the  principal  will  have  to  reward  the 
agent  more  frequently  because  success  is  more  likely.  But  this  also  implies  that  greater  coercion,  which 
reduces  these  rewards,  becomes  more  valuable  to  the  principal.  An  implication  of  this  observation 
is  that  agents  with  better  outside  options  will  be  coerced  less.  Another  is  that  agents  with  higher 
marginal  productivities  of  labor  in  the  coercive  sector  will  be  coerced  more. 

We  show  that,  consistent  with  the  famous  work  by  Fogel  and  Engerman  (1974),  coercion  increases 
effort.  However,  our  formulation  also  implies  that,  in  contrast  to  Fogel  and  Engerman's  interpretation, 
this  does  not  imply  that  slavery  (or  coercion)  is  or  may  be  "efhcient" .  On  the  contrary,  coercion  always 
reduces  (utihtarian)  social  welfare,  because  the  structure  of  the  principal- agent  model  dictates  that 
coercion  hurts  the  agent  more  than  the  additional  effort  it  leads  to  helps  the  principal.  Our  model 
also  shows  that  coercion  changes  both  workers'  and  producers'  incentives  to  make  ex  ante  investments 
in  their  relationships,  and  points  out  a  new  channel  via  which  trading  coerced  workers  makes  them 
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worse  off. 

A  major  question  in  the  economics  of  coercion,  both  from  a  historical  perspective  and  for  under- 
standing the  continued  prevalence  of  forced  labor  in  many  developing  societies  today,  is  the  effect 
of  labor  scarcity/abundance  on  coercion.  In  a  seminal  article  on  the  Caribbean  plantation  complex, 
Domar  (1970)  argued  that  labor  scarcity  encouraged  coercion  (by  increasing  the  wage,  which  the  em- 
ployers could  save  by  using  slavery).  In  contrast,  the  most  famous  theories  of  the  decline  of  feudalism 
and  serfdom  link  these  events  to  the  sharp  reductions  in  population  experienced  in  Western  Europe 
starting  in  the  second  half  of  the  15th  century.  Acemoglu,  Johnson  and  Robinson  (2002),  in  turn, 
suggested  that  Europeans  were  more  likely  to  set  up  coercive  institutions  in  colonies  with  abundant 
labor,  because  setting  up  such  institutions  was  not  profitable  when  labor  was  scarce.  Our  general 
equilibrium  analysis  shows  why  these  diverse  perspectives  are  not  contradictory.  Labor  scarcity  first 
creates  a  labor  demand  effect:  it  increases  the  marginal  product  of  workers  in  the  coercive  sector,  and 
thus  encourages  employers  to  use  greater  coercion  and  extract  higher  effort  from  their  workers.  It 
also  creates  an  outside  option  effect:  it  increeises  the  outside  option  of  the  workers  in  the  noncoercive 
sector,  and  reduces  coercion  because  employers  demand  lower  effort  and  use  less  coercion  with  workers 
that  have  greater  outside  options.  Domar's  argument  for  Caribbean  slavery  applies  when  the  labor 
demand  effect  is  strong  and  the  outside  option  effect  is  hmited.  In  contrast,  when  the  outside  option 
effect  is  important,  perhaps  as  in  Western  Europe  in  the  16th  century  because  of  the  presence  of  well- 
established  cities,  labor  scarcity  will  discourage  coercion.  Finally,  a  slight  modification  of  our  model, 
where  coercion  investments  are  made  before  matching,  also  leads  to  the  result  that  labor  scarcity 
may  discourage  coercion  through  an  economies  of  scale  effect  (because  coercing  workers  becomes  more 
costly  when  labor  is  scarce). 

We  view  this  paper  as  a  first  step  towards  a  systematic  analysis  of  coercion  in  the  labor  market  and 
its  implications  for  the  organization  of  production  and  economic  development.  Despite  the  historical 
and  current  importance  of  forced  labor  and  other  coercive  relations,  many  central  questions  in  the 
economics  of  coercion  have  not  previously  been  answered  or  even  posed.  Our  framework  provides  a 
first  set  of  answers  to  these  questions  and  can  serve  as  a  starting  point  for  different  directions  of  study. 
Theoretical  and  empirical  investigations  of  the  dynamics  of  coercion,  of  why  coercive  relationships 
persist  in  many  developing  countries  even  today,  of  the  effects  of  coercion  on  technology  choices  and 
organizational  decisions,  and  of  how  coercive  production  impacts  trade  are  important  areas  for  future 
research  (see,  for  example,  Naidu  and  Yuchtman,  2009,  for  a  recent  empirical  study  of  labor  coercion). 
A  particularly  fruitful  area  of  future  research  is  a  more  in-depth  analysis  of  the  politics  of  coercion. 
We  presumed  the  presence  of  an  institutional  environment  that  permitted  coercion  by  producers.  In 
many  instances,  coercion  comes  to  an  end,  or  is  significantly  curtailed,  when  poUtical  forces  induce 
a  change  in  the  institutional  environment.  Combining  our  microeconomic  model  of  coercion  with  a 
model  of  endogenous  institutions  would  be  one  way  of  making  progress  in  this  direction. 
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Appendix  A:  Proofs  of  Propositions  1  and  2,  and  Lemma  3 

Proof  of  Proposition  1 

Suppose  first  that  an  equilibrium  contract  involves  a  >  0.  Recall  that  with  two  possible  outcomes, 
the  first-order  approach  is  valid  (see  Appendix  B  for  more  on  this),  so  the  incentive  compatibihty 
constraint  can  be  written  in  the  form  of  (IC2),  which  also  exploits  the  fact  that,  given  our  assumption 
lima_i  c'  (a)  =  00,  a  <  1,  and  thus  the  first-order  condition  must  hold  as  equality.  Then  the  producer's 
problem  can  be  written  as 

max  a  (Px  -  w'']  +  {1  -  a)  (-w']  -  rjxig) 

(a,5,ui'',u)',p'',p')e[0,l]xP=        ^  ^  ^  ' 


*+ 


subject  to 


and  0 


(w^-p^^+{l-a)(w^-p^^-cia)>u-g  .      (IRq) 

■■'      p'-(w^-p')=c'{a).  _  .   ./^       (IC2) 


w 


It  is  suboptimal  to  set  both  w    >  0  and  p    >  0,  or  w    >  0  and  p   >  0,  as  reducing  both  w     {w  ) 
and  p''  (p')  by  e  would  strictly  increase  profits  without  affecting  the  constraints,  so  w'^  =  [u^].   and 
=  [u']  ,  •  The  problem  becomes 


w^ 


subject  to 


and 


max                a  [  Px  ~    v!'       )  -  (1  -  a)    u'       -  rjx{g)  '        (A-1) 

(a,3,u'',u')e|0,l]xR+xR2       \  L       i+J  L      J+  ' 

av!" +  {\  -  a)v}  -  c{a)>u- g  (IRi) 

v!" -vl- =  d{a).  (ICi) 


Substituting  (ICi)  into  (IRi)  yields 


{l-a)c'  {a)-c{a)>u-g.  (IR2) 


(IR2)  must  bind,  as  otherwise  decreasing  u''  would  increase  profit.  Using  (ICi)  and  (IR2)  to  substitute 
u'^  and  u'  out  of  (A-1)  now  yields  (7). 

It  remains  only  to  be  shown  that  g  >  0,  u'  <  0,  and  u^  >  0,  and  finally  that  a  >  0  in  any 
equihbrium  contract  (that  w'^  =  {I  -  a)  c'  (a)  +  c{a)  +u-  g  >  0  and  p'  =  -ac'  {a)  +  c{a)  +u-  g  >  0 
then  following  immediately  from  (IR2)  and  (ICi)).  First,  the  result  that  5  >  0  in  any  equilibrium 
contract  with  a  >  0  follows  from  (7),  since  it  must  be  that  x'  (5)  S  f^A  in  any  equihbrium  contract. 
The  result  that  u'  <  0  and  w'*  >  0  is  established  in  the  next  lemma. 

Lemma  6  In  any  equilibrium  contract  with  a  >  0,  we  have  u'  <  0  and  u    >Q. 
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Proof.  Note  first  that  the  Lagrangian  for  (A-1)  subject  to  (IR2)  and  (ICi)  is 


a{  Px- 


(l-a) 


-VXi9)  +  ^  (au'^  +  (1  -  a)u'  -  c{a)  -  {u-  gyj+^i  (u^-  u''  -  c{a))  . 

(A-2) 

Now  suppose  to  obtain  a  contradiction  that  u'   >  0.     Then   [w']      =  u'  and  is  differentiable. 

Moreover,  in  this  case  u'*  >  0,  as  u^  >  u'  (since  a  >  0),  and  thus  [u^].   =  u'^  is  also  differentiable. 


dlu'']  d[u'] 

Moreover,  clearly,  -^  =  -^ 


and 


—  —    ^    '"^  =  1.  Then  differentiating  (A-2)  with  respect  to  u^  and  u',  we  obtain 


1  =  A  + 


1  =  A 


M 


1  -a 
,1  ^^  „,h 


(FOCu'') 


(FOCu') 


These  first-order  conditions  always  hold,  as  setting  u  or  u    to  00  or  —00  cannot  be  optimal  if  a  G  (0, 1). 
Combining  {FOCu^)  and  (FOCu')  then  imphes  that  ^  =  0. 

Now  differentiating  (A-2)  with  respect  to  a  and  using  (ICi)  yields 


Px 


J  + 


u 


fj.c"{a). 


(FOCa) 


(FOCa)  holds  with  equality  by  our  assumptions  that  a  >  0  and  that  lima^i  c'  (a)  =  00.  The  fact  that 
fi  =  0  then  implies  that  [u'^]  —  [u']  =  u'^  -  u'  =  Px.  Substituting  [u'*]  -  [u']  =  u'^  —  u'-  =  Px 
and  fi  =  0  into  the  Lagrangian  (A-2),  we  obtain 


-u'  -  rix  {§)  +  x(u^  +  Px-c  (a)  -  {u  -  g)^  . 


This  expression  is  maximized  at  the  corner  a  =  0,  contradicting  our  assumption  that  a  >  0.  This 
completes  the  proof  of  claim  that  u'  <  0.  <  ,'         '■  ' 

Finally,  to  show  u''  >  0,  suppose  to  obtain  a  contradiction  that  u'^  <  0.  Then  (A-2)  is  differentiable 
with  respect  to  u^  at  the  optimum,  and  its  derivative  with  respect  to  u^  is 

aA  +  /i.  , 

Since  a  >  0,  this  can  equal  0  only  if  A  =  0  and  (1  =  0.  But  then  the  maximum  of  (A-2)  over  a  £  [0, 1] 
is  attained  at  a  =  1,  which  implies  that  the  producer's  profit  equals  —00,  by  (IR2)  and  our  assumption 
that  lima^i  c'  (a)  =  00,  which  yields  a  contradiction.    ■ 

It  remains  only  to  check  that  a  solution 'to  the  producer's  problem  with  a  >  0  exists  if  Px  > 
u  +  c'(0).  Consider  the  producer's  problem  as  first  choosing  a  and  then  maximizing  profit  given 
a.  The  producer's  optimal  profit  given  fixed  a  is  continuous  in  a.  Therefore,  if  we  show  that  the 
producer's  optimal  profit  given  a  is  increasing  in  a  for  all  sufficiently  small  a  >  0,  we  will  have  showTi 
that  a  =  0  is  not  a  solution. 
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The  producer's  problem,  given  a  >  0,  is  maximizing  (7)  over  g  e  M+:       '  >•  . ' 
max  a  (Px  -  [{I  -  a)  c'  (a)  +  c{a)  +  u  -  g]    )  -  (1  -  a)  [-ac'  {a)  +  c{a)  +  u-  g]     -  rjx  (g)  ■    (A-3) 
The  right-derivative  of  (A-3)  with  respect  to  a  is 


Px- 


-m' 


-a(l-a)c"(a)     -1^-^     ,  (A-4) 


where  the  d  [u^].  /du'^  and  d  [u']  ,  /du^  denote  right-derivatives.  We  have  shown  above  that  u^'  >  0 
and  u'^  >  u'  at  an  optimum  with  a  >  0,  and  d  [u^],  /du^  —  d  [u']  ,  /du^  <  1,  so  (A-4)  is  no  less 
than  Px  —  u'*  —  a  (1  —  a)  c"  (a).  By  (IR2),  as  a  converges  to  0,  u'*  converges  to  at  most  u  +  c'  (0). 
Therefore,  provided  that  Px  >  u  +  c'  (0),  (A-4),  and  thus  the  derivative  of  (A-3)  with  respect  to  a,  is 
strictly  positive  for  sufficiently  small  a.  This  establishes  that  a  >  0,  and  completes  the  proof  of  the 
Proposition.  ■ 

Proof  of  Proposition  2  . 

The  maximization  problem  (7)  is  weakly  supermodular  in  {a,g,x,P,  —ft,  —i])  everywhere.  Therefore, 
the  results  that  the  set  of  equihbrium  contracts  for  a  producer  of  type  x  forms  a  lattice,  and  that  these 
greatest  and  smallest  equilibrium  contracts,  (a+  (x)  ,g'^  {x})  and  {a"  (x)  ,g^  (x)),  are  nondecreasing 
in  X  and  P  and  nonincreasing  in  u  and  7;  follow  immediately  (e.g.,  using  Theorems  2.7.1  and  2.8.1  in 
Topkis  (1998)). 

In  the  rest  of  the  proof,  we  show  that  these  comparative  static  results  are  strict  in  the  sense  that 
whenever  we  have  a  change  from  {x,P,u,ri)  to  {x',  P',u','rj'},  where  x'  >  x,  P'  >  P,  u'  <  u,  and 
T]'  <  T],  with  at  least  one  strict  inequality,  we  have  "strictly  increasing"  instead  of  "nondecreasing"  and 
"strictly  decreasing"  instead  of  "nonincreasing".  In  the  process  of  doing  this,  we  will  also  establish  the 
"genericity"  resvilt  claimed  in  Remark  3.  These  results  are  stated  in  the  following  lemma,  the  proof 
of  which  completes  the  proof  of  the  proposition.  ^: 

Lemma  7  Consider  the  maximization  problem  (7),  let  a  (x,  P,  u,  r])  and  g  (x,  P,  u,  r/)  denote  the  least 
for  greatest]  solution  to  (7),  then  a{x' ,P' ,u' ,ri')  >  a{x,  P,u,t])  and  g  {x'  ,  P'  ,u' ,7]')  >  g{x,  P,u,ri) 
for  any  [x' ,P' ,u' ,ri')  where  x'  >  x,  P'  >  P.  v!  <  u,  and  7/'  <  r/,  with  at  least  one  strict  inequality. 
Moreover,  let  u'  (x,  P,  u,  77)  be  a  solution  at  the  parameter  vector  (x,  P,  u,  r]).  If  u'  (x,  P,  u,  77)  >  0,  then 
u'ix',P\u\r)')  <0. 

Proof.  For  brevity,  we  will  prove  this  lemma  for  a  change  in  x,  holding  P,  u,  and  77  constant.  The 
argument  for  the  other  cases  is  analogous. 

Consider  a  change  from  x  to  x'  >  x  >  x  (where  recall  that  x  is  the  lower  support  of  the  distribution 
P).  The  first  part  of  Proposition  2,  which  has  already  been  estabUshed,  imphes  that  a  (x',  P',  u' ,  rj')  > 
a  (x,  P,  u,  77)  and  g  (x',  P',  u' ,  rj')  >  g  (x,  P,  u,  77),  which  we  shorten  to  a  (x')  >  a  (x)  and  g  (x')  >  g  (x). 
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It  is  then  straightforward  to  verify  that  if  w'  (x)  <  0,  then  (7)  can  be  written  as 

(a,^)  e  arg         max         Pxa  —  d\{l  —  a)c' {a)  +  c{a)  +  u  -  g]     —  r]x{g)  , 
(a,g)e[o,ilxR+  ■'  + 

which  is  strictly  supermodular  in  x  (or  more  generally  in  {x,  P,u,r]))  in  the  neighborhood  of  x,  and 

moreover  g  is  given  by  (10).    Then  since  a  >  0,  and  thus  g  >  0,  a  and  g  cannot  remain  constant 

following  an  increase  in  x,  and  since  they  cannot  decline,  it  must  be  the  case  that  an  increase  in  x 

strictly  increases  a  and  g. 

Therefore,  any  change  from  x  to  x'  >  x  will  give  weak  comparative  comparative  static  results  only 

if  u'  (£)  =  0  for  all  x  G  [x,  x']  (from  Lemma  6,  we  have  u'  (i)  <  0,  so  vJ  (x)  >  0  is  not  possible).  We 

will  next  estabhsh  that  u'  (x)  =  0  (and  thus  weak  (but  not  strict)  supermodularity  is  only  possible 

for  infinitesimal  changes  at  x  =  x).  Suppose,  to  obtain  a  contradiction,  that  this  is  the  case,  and  in 

particular,  that  for  the  least  solutions  in  both  cases,  we  have  a[x)  =  a  (x')  (the  exact  same  argument 

applies  to  greatest  solutions).  Then  since  x  >  x,  we  must  have  u'  (x)  =  0  in  the  neighborhood  of  x. 

This  implies,  from  (ICi)  and  (IR2),  that 

u^  =  -ad  (a)  +  c  (a)  +  u  -  5  =  0.  _  /  -\  (A-5) 

Consider  variations  in  a  and  g  along  (A-5)  (i.e.,  holding  u'  =  0).  Since  a  >  0  and  c  is  differentiable, 

this  implies  ,     .-„  ..  ...■..,      ■,,,,,, 

,        ,        '     ■        ^  =  -ac"(a)<0.        -    ,  ■  ,  •     . 

da  ..---- 

Using  this  expression,  we  obtain  that  if  a{x)  =  a  (x'),  then  g{x)  =  g  [x').  Next,  since  a  (x)  and  g  (x) 

are  optimal,  any  such  variation  along  (A-5)  should  not  increase  the  value  of  (7).  This  is  only  possible 

if 

Px-a  (x)  c"  (a  (x))  -  (1  -  a  (x))  d  {a  (x))  -  c  (a  (x))  -u  +  g{x)  +  77/  (g  (x))  a  (x)  c"  {a  (x))  =  0,   (A-6) 

where  we  have  used  the  fact  that  u''  >  0  wherever  u'  =  0,  by  (ICi)  and  the  fact  that  a  >  0. 
Now  repeating  the  same  argument  for  x',  and  recalling  that  by  hypothesis  we  have  w'  (x')  =  0  and 
a  (x)  =  a  (x'),  and  thus  g  (x)  =  g  (x')  (as  just  established),  we  also  have 

Px'-a(x)c"(a(x))-(l  -  a{x))  c  {a{x))  -  c{a{x)) -u  +  g  {x)  +  rix' {9  (x))  a{x)  c"  {a{x))  =  0.   (A-7) 

However,  in  view  of  the  fact  that  x'  >  x,  (A-7)  cannot  be  true  at  the  same  time  as  (A-6).  This  implies 
that  (i)  either  u' (x)  <  0  or  v}  [x')  <  0,  or  (ii)  a  (x')  >  a  (x)  (a  (x)  >  a(x')  is  ruled  out  by  weak 
supermodularity).  But  if  a  (x')  >  a  (x)  and  ■u' (x')  =  u' (x),  then  (A-5)  implies  that  g{x')  <  g{x), 
contradicting  the  fact  that  g[x')  >  g  (x).  So  in  either  case  we  have  u'  (x')  <  0  if  u'  (x)  =  0.  Since  this 
is  true  for  any  x'  >  x,  we  can  never  have  u'  (x)  =  0  and  u'  (x')  =  0  for  any  x'  >  x,  and  thus  we  can 
only  have  u'  (x)  =  0  at  x  =  x.  Moreover,  as  u'  (x')  <  0,  strict  comparative  statics  apply  at  x  =  x'. 
Since  this  is  true  for  any  pair  of  x,  x'  >  x,  we  conclude  that  (i)  all  comparative  statics  are  strict  (even 
if  u'  (x)  =  0,  we  must  have  u'  (x')  <  0,  and  thus  a  change  from  x  to  x'  will  strictly  increase  a  and  g), 
and  (ii)  for  any  pair  of  x'  >  x,  if  u'  (x)  =  0,  we  must  have  u'  (x')  <  0,  and  thus  u'  =  0  is  only  possible 
at  one  level  of  x,  that  is,  at  the  lowest  level  x  =  x.    ■ 
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Proof  of  Lemma  3  ^  ^ 

Fix  t'  >  t.  That  F  (x,  t)  and  F  (x,  t')  have  fixed  points  follows  from  Kakutani's  Fixed  Point  Theorem. 
Since  F  (i,  t)  and  F  [x,  t')  have  closed  graphs,  as  does  the  45°  hne,  the  intersections  of  F  (x,  t)  and 
F  (x,  t')  with  45°  lines  are  closed,  so  F  (x,  t)  and  F  (x,  t')  have  smallest  and  greatest  fixed  points. 

Let  y  be  the  smallest  fixed  point  of  F{x,t).  For  any  x  <  y,  every  element  of  F{x.t)  is  strictly 
greater  than  x,  as  otherwise  applying  Kakutani's  Fixed  Point  Theorem  to  the  interval  [0,  x]  would 
imply  that  F  (x,  t)  has  a  fixed  point  smaller  than  x,  and  thus  smaller  than  y.  By  the  definition  of  the 
weak  set  order,  this  implies  that  every  element  of  F  (x,  t')  is  strictly  greater  than  x,  for  any  x  <  y.  So 
the  smallest  fixed  point  of  F  (x,  i')  is  no  smaller  than  J/.  .     :■ 

Let  z  be  the  greatest  fixed  point  of  F  (x,  t).  Then  either  z  E  F  {z,  t')  or  every  element  of  F  {z,  f) 
is  strictly  greater  than  z.  In  the  latter  case,  applying  Kakutani's  Fixed  Point  Theorem  to  the  interval 
[z,x]  implies  that  F{x,t')  has  a  fixed  point  greater  than  z.  In  either  case,  the  greatest  fixed  point  of 
F  (x,  t')  is  no  smaller  than  z.  ■ 

Appendix  B:  Multiple  Output  Levels 

In  this  Appendix,  we  allow  for  an  arbitrary  finite  nrunber  of  outputs.  Let  /  (y|a)  is  the  probability 
that  output  equals  y  given  effort  a,  and  assume  that  f  {y\a)  is  twice-differentiable  with  respect  to  a. 
Let  y  be  the  highest  possible  output  and  let  y  be  the  lowest  possible  output.  We  normalize  the  price 
of  output  to  1  and  do  not  consider  producer  heterogeneity.''^  Equilibrium  contracts  are  given  by  the 
solution  to  the  following  problem: 

max      y^^iy -w{y)}f{y\a} -rix{g), 

subject  to  _  ' 

-      Xl(^(^)  ~P(^)^-^(2/ia) -c(a)  >  iZ-g,  (IRbo) 


and 

a  e  arg  max  V(u)(iy) -p(y))/(2/|a) -c(a).  (ICso) 

We  continue  to  focus  on  the  case  where  equilibrium  contracts  involve  a  >  0.       Then,  from  Theorem 
1  in  Jewitt  (1988),  the  first-order  approach  to  this  problem  is  valid  provided  that: 

l-  11z<z  Yly<z  /  (yk)  is  nonincreasing  and  convex  in  a  for  each  z; 


""^It  is  straightforward  to  see  that  if  producers  differ  according  to  productivity  parameter  i  and  a  producer  with 
productivity  x  produces  output  xy  with  probability  /  {y\a),  all  supermodularity  results  from  Section  3  will  continue  to 
hold. 

■•'We  do  not  spell  out  the  assumptions  on  primatives  under  which  equilibrium  contracts  involve  a  >  0.  Assumption 
1  suffices  for  this  in  the  two-outcome  case,  and  similar  sufficient  conditions  can  be  developed  for  the  case  with  multiple 
output  levels,  but  this  is  orthogonal  to  our  focus  here. 
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2-  YlyKzyf  {y\°)  ^^  nondecreasing  and  concave  in  a  for  each  z\ 
3.    f(\a)  is  nondecreasing  and  concave  in  y  for  every  a. 

Jewitt  this  provides  an  interpretation  of  these  conditions.  Note  Condition  3  implies  the  usual  (and 
relatively  weak)  monotone  likelihood  ratio  property  (MLRP).  Therefore,  MLRP  holds  throughout  this 
appendix.  Jewitt  argues  that  the  remainder  of  the  third  condition  is  the  most  restrictive;  in  addition 
to  MLRP,  "[it  requires  that]  variations  in  output  at  higher  levels  are  relatively  less  useful  in  providing 
"information"  on  the  agents  effort  than  they  are  at  lower  levels  of  output,"  (Jewitt,  1988,  p.  1181). 
Note  that  Jewitt's  condition  on  utility  functions  is  not  needed  when  the  agent  is  risk-neutral. 

Given  these  three  conditions,  and  simplifying  the  exposition  by  imposing  u'  <  0,'**  we  can  apply 
the  first-order  approach  and  (writing  u  {y)  for  w  (y)  —  p  (y))  rewrite  the  problem  as 

,  ,P^    ,y"(y- ['"(y)]+)/(2/|a)-7?x(p) 

subject  to 

Y,u{y)f{y\a)-c{a)>u-g  (IRbi) 


and 

Y,u{y)My\a)  =  c'{a).  (IC 


Bl) 


The  associated  Lagrangian  is  '■'■..' 

Yl  [{y  -  ["  (2/)]+)  /  (y|a)  -  VXig)  +  A  (u  {y)  /(y|a)  -  c{a)  -  (u  -  g))  +  ^i  {u  (y)  fa{y\a)  -  c'{a))]  . 

y  .    ,      .  ,     ■  r     ■  ■  .    '     ^  ,■  •     , 

Differentiating  under  the  integral  with  respect  to  u{y)  and  rearranging  implies  that 


f{y\a)  ■ 

n  \    _L         /°  (2/|q)  '  •.    .  /p    ON 

0  =  X  +  fi  B-2 

■    •       .  fiy\a)  ■   ... 

f{y\a) 

if  u(y)  =  0.  By  MLRP,  (B-1),  (B-2),  and  (B-3)  imply  that  u{y)  =  0  for  aU  y  ^  1^:^},  u  (y)  <  0, 
and  u  (y)  >  0.  This  is  a  standard  "bang-bang"  result  given  MLRP  and  the  agent's  risk-neutrality.  To 
simphfy  notation,  let  u'^  =  u{y)  and  let  u^  =  u  (y),  paralleling  the  notation  in  subsection  3.1. 


ifu(y)>0, 

if  u  (y)  <  0,  and 


Under  suitable  restrictions  on  f(y\a),  which  we  neglect  here  for  ease  of  exposition,  u;  <  0  in  any  equilibrium  contract 
for  "generic"  parameter  values,  as  in  the  two-outcome  ca.se  considered  in  the  text. 
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The  producer's  maximization  problem  can  now  be  written  as 


subject  to 


and 


(ICb2)  can  be  rewritten  as 


max  y]y/(j/|a) -/(y|a) 

(o,ff,u'',u')elO,l]xR+xR2  ^ 


f  (2/|a)  u''  +  /  (y|a)  v}  -  c{a)  >u-g 
'      Ja{y\a)u^  +  fa{:y\a)v}  =  c'{a). 
■  -      ,       c'(a)-/„(y|a)u''        ■.     , 


Tixig) 


Substituting  (B-4)  into  (IRb?)  and  using  the  fact  that  (IRb2)  binds  at  the  solution  gives 


(IRb2) 

(ICb2) 

(B-4) 


which  may  be  rewritten  as 


fa  [y\a)      -  fa  {y\a) 


i   \       !{y\a)    I  f  ^   ,   - 


(B-5) 


Finally,  substituting  (B-5)  into  the  principal's  objective  gives 

(a,g)s(0,l)xR+  ^—^ 


I    \  f(y\a)     ,  ,    <.     ^    - 


/(^»-if(S/(^ 


vxig), 


(B-6) 


To  establish  the  supermodularity  of  (B-6)  in  (a,  g,  —u,  —rj),  it  suffices  to  show  that  the  cross-partial 
of  the  right-hand  side  of  (B-6)  with  respect  to  a  and  g  is  always  nonnegative.  This  is  immediate  if 
[u'^],  =  0.  If  [u^],   >  0,  the  right-hand  side  of  (B-6)  may  be  rewTitten  as 

^yf{y\a)  - 


fo{y\a) 


y       U(y»  j  /  I  /(yia)    ]  / 


!a{y\n) 


^-    '-c'{a)+u-g\  -vxig],  (B-7) 


fa  [y\a). 


where  we  have  divided  the  numerator  and  denominator  of  the  middle  term  by  fa[y\a).  The 
cross-partial  of  (B-7)  with  respect  to  a  and  g  is  nonnegative  if  and  only  if  the  derivative  of 
( "/Ha/  )  /  (    f(\  °)   ]   ^'''^''■'^  respect  to  a  is  nonnegative.    We  have  thus  established  the  foUowing  re- 


sult: 


Proposition  18  Suppose  that  a  >  0,   Conditions  1 — 3 


''*«"^(^)/(^ 


IS  increasing 


in  a.   Then  equilibrium  contracts  are  given  by  (B-6),  and  (B-6)  is  supermodular  in  {a,g,  — u,  —rj). 
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The  condition  that  (   f(-\a)  )  I  (    w  ,  I  J   is  increasing  in  a  is  neither  particularly  restrictive  nor 


f{y\o.) 
particularly  easy  to  interpret.  To  see  why  it  is  sufficient  for  supermodularity  of  equilibrium  contracts, 

note  that,  by  (B-5)  (which  is  determined  by  (IRbj)  and  (ICb2))i  increasing  g  by  A  allows  the  principal 

to  reduce  w^  (i.e.,  payment  after  the  highest  output  level)  by  777;—; — /a(g|°)  ,1  ~-  Since  the  principal 


pays  w^  with  probability  /  (y|a),  increasing  5  by  A  benefits  the  principal  by  — ; uma)    — n'i  which 

is  increasing  in  a  if  (  f^-\^\  )  /  I  °)~  ■.  I  is.  It  is  instructive  to  compare  this  with  the  following  slight 
generalization  of  the  two-outcome  case  discussed  in  Section  3.1:  Suppose  there  are  only  two  outcomes 
y<y,  but  that  /  {y\a)  need  not  equal  a.  Then,  /  {y\a)  -  J°|^K  /  {y\a)  =  f  {y\a)  +  f  (yja)  =  1  (as 
fa  {y\a)  =  -fa  (yja)),  so  /afsiajTTTT  =  ^-  Therefore,  increasing  5  by  A  benefits  the  principal 

by  /  {y\a),  which  is  increasing  in  a  under  MLRP.  In  particxilar,  we  see  that  (  f(-\a)  )  I  \  1(1  \  )  ^^ 
increasing  in  a  if  /  {y\a)  +  f  (y|a)  does  not  depend  on  a,  regardless  of  the  number  of  outcomes. 

Appendix  C:  Existence  of  Equilibrium  in  the  General  Case 

In  this  appendix,  we  use  the  distributional  approach  to  equilibrium  in  games  with  a  continuum  of 
players  to  establish  existence  of  "mixed-strategy  equilibria"  in  the  general  model  of  Section  4,  and 
show  that  analogs  of  Propositions  6  and  7  apply  to  extremal  mixed-strategy  equilibria.  The  analysis 
also  shows  that  our  approach  in  the  text  carries  over  to  the  case  where  F  {■)  has  mass  points.  We 
establish  existence  of  a  mixed-strategy  equilibrium  using  Theorem  1  of  Mas-Collel  (1984),  but  applying 
this  result  requires  us  to  recast  the  maximization  problem  characterizing  equilibrium  contracts  in  an 
equivalent  form. 

In  our  original  formulation  of  the  producer's  problem,  we  viewed  a  producer  with  productivity  x  as 
choosing  (0,5)  6  E^  to  maximize  aPx  —  a  [(1  —  a)  c'  {a)  +  c{a)  -\-  u  —  g]_^  —  'qx{9),  and  we  also  assumed 
that  the  distribution  of  productivity  F  had  a  continuous  density  /.  Both  of  these  assumptions  were 
adopted  to  simplify  the  notation  in  the  text  and  can  be  relaxed.  This  appendix  outlines  how  this  can 
be  done.  '    _  ■  .  , 

First,  note  that  since  limg_ooX(5)  =  ^o  and  a  <  1,  there  exists  a  positive  number  g  such  that 
setting  g  >  g  is  dominated  for  any  producer.  We  now  specify  that  a  producer  with  productivity  x 
chooses  [q.g)  G  [0,x]  x  [0,^]  to  maximize 

'W-'-I[('-|)='(f)-(l)— l-('-|)[-F(!)-(!)— i--(^)- 

(C-l) 
where  we  have  rewritten  a  as  |.  By  our  assumption  that  lima_i  c' (a)  =  00,  any  solution  to  the 
problem  of  a  producer  with  productivity  x  satisfies  q  <  x.  It  is  also  clear  that  (C-l)  is  continuous  in 
iP,Q)-  .        ■ 
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The  point  of  the  above  reformulation  is  to  give  each  producer  the  same  action  set,  and  our  as- 
sumption that  hmQ_,i  c'  (a)  =  oo  ensures  that  every  equihbrium  contract  in  the  reformulated  game  is 
feasible  in  the  original  game.  Given  this,  existence  of  a  mixed-strategy  equilibrium  (or  "Cournot-Nash 
equilibrium  distribution"  in  the  language  of  Mas-Collel)  is  immediate  from  Theorem  1  of  Mas-Collel. 
Formally,  a  mixed- strategy  equilibrium  is  now  a  measure  r  over  tt  (x)  x  ([0,  x]  x  [O,^])  such  that  the 
marginals  t^  and  r(^g)of  r  on  it  [x)  and  [0,x]  x  [0,^],  respectively,  satisfy  ,    ..      ^ 

1-  T^  =  f,  ■      ■  '  :  ■ 

2.  T-({7r(x),g,5}  :  g  <  x)  =  1,  and    ,         -'"':'  ,         ,    _     ,        -    ,  _    - 

3.  r  ({tt  (x)  ,  (?,  5}  :  tt  (x,  g,  5,  rj,^^))  >  tt  (x,  g,  5,  T(,  ^j)   for  all   (g,  ~g)  e  [0,  x]  x  [0,  g])  =  1, 

where  tt  [x^q^g.Tf^^g^  is  payoff  to  a  producer  with  productivity  x  who  chooses  (g, g)  where  other 
producers'  actions  are  given  by  the  distribution  T^^gy  We  also  write  n  (x,  g,  g,  Q,  G)  for  the  payoff  to  a 
producer  with  productivity  x  who  chooses  (g,^)  when  facing  aggregates  Q  and  G,  as  the  distribution 
r(g  g)  affects  each  producer's  payoff  only  through  Q  and  G. 

Proposition  19  A  mixed-strategy  equilibrium  exists. 

Proof.  The  proof  follows  closely  the  proof  of  Mas-Collel's  Theorem  1,  with  the  addition  that  each 
distribution  consistent  with  producers'  playing  best  responses  satisfies  point  2  above  (since  g  <  x  in 
any  equilibrium  contract  of  a  producer  with  productivity  x,  by  our  assumption  that  limQ_i  c'  (a)  •=  00), 
so  any  fixed  point  must  satisfy  point  2  as  well.    ■ 

Finally,  we  show  that  analogs  of  Propositions  6  and  7  apply  to  mixed-strategy  equilibria  of  the 
kind  considered  above.  The  argument  is  identical  to  that  leading  to  Propositions  6  and  7,  adapted 
to  the  distributional  definition  of  equilibria — we  therefore  omit  some  details.  Given  a  joint  density 
Tiq^g\  over  [0,  x]  x  [0,^],  let  Tq  and  Tg  be  the  corresponding  marginal  densities  over  [0,  x]  and  [0,^], 
respectively.  Let 


and  let 


Jo 
"^(^(9,3))  =  /    9Tg{g)dg, 


0 


the  corresponding  levels  of  the  aggregates.  Let 


Q'  :  there  exists  a  density  r  on  tt  (x)  x  ([0,x]  x  [Q,g])  such  that 
4>'  {Q,"f,L,u,r])  =  ^    T  ({7r(x),g,g}  :  tt  (x,  g,  g,  Q,  G  (r(,g)))  >  tt  (x,  g,  g,  Q,  G  (T(,_g))))  =  1 

and  Q'  =  Q  (-r(g,g)) 

That  is,  ^  maps  Q  and  parameter  values  to  those  Q'  that  are  mixed-strategy  equilibrium  levels  of 
output  in  the  modified  game  where  price  is  fixed  at  P  [LQ).  It  is  clear  that  the  set  of  fixed  points  of 
0  equals  the  set  of  mixed-strategy  equilibrium  values  of  Q.  We  may  now  establish  the  following: 
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Lemma  8   The  greatest  and  smallest  elements  of  ip  {Q,^,  L,u,r])  are  increasing  in  F  (x)  and  7,  and 
decreasing  in  Q,  L,  u,  and  rj. 

Proof.  Analogous  to  proof  of  Lemma  4.    ■ 

Proposition  20   The  greatest  and  smallest  mixed-strategy  equilibrium  values  of  Q  are  increasing  in 
F  (x)  and  7,  and  decreasing  in  L,  u,  and  rj. 

Proof.  Analogous  to  proof  of  Proposition  6.    ■ 

Repeating  the  above  argument  interchanging  the  roles  of  Q  and  G  gives  the  following: 

Proposition  21    The  greatest  and  smallest  mixed- strategy  equilibrium  values  of  G  are  increasing  in 
F  (x)  and  7,  and  decreasing  in  L,  u,  and  77. 

Propositions  20  and  21  show  that  the  results  of  Section  4.3  continue  to  apply  when  we  allow  for 
mixed  strategies,  which  addresses  the  issue  of  equihbrium  existence. 
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